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TRANSPORTERS AND ION CHANNELS 
TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of transporters and ion channels 
5 and to the use of these sequences in the diagnosis, treatment, and prevention of transport, neurological, 
muscle, immunological, and cell proliferative disorders, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of transporters and 
ion channels. 

10 BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes which are highly impermeable to most polar molecules. Cells and 
organelles require transport proteins to import and export essential nutrients and metal ions including 
K + , NH 4 + , P i7 S0 4 2 \ sugars, and vitamins, as well as various metabolic waste products. Transport 

15 proteins also play roles in antibiotic resistance, toxin secretion, ion balance, synaptic neurotransmission, 
kidney function, intestinal absorption, tumor growth, and other diverse cell functions (Griffith, J. and C. 
Sansom (1998) The Transporter Facts Book , Academic Press, San Diego CA, pp. 3-29). Transport 
can occur by a passive concentration-dependent mechanism, or can be linked to an energy source such 
as ATP hydrolysis or an ion gradient. Proteins that function in transport include carrier proteins, which 

20 bind to a specific solute and undergo a conformational change that translocates the bound solute across 
the membrane, and channel proteins, which form hydrophilic pores that allow specific solutes to diffuse 
through the membrane down an electrochemical solute gradient. 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 

25 simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (antiport). For example, intestinal and kidney epithelium contains a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 

30 Na + /K + ATPase system. Sodium-coupled transporters include the mammalian glucose transporter 

(SGLT1), iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have 
twelve putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically- 
oriented N- and C-termini. NIS plays a crucial role in the evaluation, diagnosis, and treatment of 
various thyroid pathologies because it is the molecular basis for radioiodide thyroid-imaging 
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techniques and for specific targeting of radioisotopes to the thyroid gland (Levy, O. et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:5568-5573). SMVT is expressed in the intestinal mucosa, kidney, and 
placenta, and is implicated in the transport of the water-soluble vitamins, e.g., biotin and pantothenate 
(Prasad, P.D. et al. (1998) J. Biol. Chem. 273:7501-7506). 
5 One of the largest families of transporters is the major facilitator superfamily (MFS), also 

called the uniporter-symporter- antiporter family. MFS transporters are single polypeptide carriers 
that transport small solutes in response to ion gradients. Members of the MFS are found in all classes 
of living organisms, and include transporters for sugars, oligosaccharides, phosphates, nitrates, 
nucleosides, monocarboxylates, and drugs. MFS transporters found in eukaryotes all have a structure 

10 comprising 12 transmembrane segments (Pao, S.S. et al. (1998) Microbiol. Molec. Biol. Rev. 62:1- 
34). The largest family of MFS transporters is the sugar transporter family, ^yhich includes the seven 
glucose transporters (GLUT1 -GLUT7) found in humans that are required for the transport of glucose 
and other hexose sugars. These glucose transport proteins have unique tissue distributions and 
physiological functions. GLUT1 provides many cell types with their basal glucose requirements and 

15 transports glucose across epithelial and endothelial barrier tissues; GLUT2 facilitates glucose uptake 
or efflux from the liver; GLUT3 regulates glucose supply to neurons; GLUT4 is responsible for 
insulin-regulated glucose disposal; and GLUTS regulates fructose uptake into skeletal muscle. 
Defects in glucose transporters are involved in a recently identified neurological syndrome causing 
infantile seizures and developmental delay, as well as glycogen storage disease, Fanconi-Bickel 

20 syndrome, and non-insulin-dependent diabetes mellitus (Mueckler, M. (1994) Eur. J. Biochem. 
219:713-725; Longo, N. andLJ. Elsas (1998) Adv. Pediatr. 45:293-313). 

Monocarboxylate anion transporters are proton-coupled symporters with a broad substrate 
specificity that includes L-lactate, pyruvate, and the ketone bodies acetate, acetoacetate, and 
beta-hydroxybutyrate. At least seven isoforms have been identified to date. The isoforms are predicted 

25 to have twelve transmembrane (TM) helical domains with a large intracellular loop between TM6 and 
TM7, and play a critical role in maintaining intracellular pH by removing the protons that are produced 
stoichiometrically with lactate during glycolysis. The best characterized H + -monocarboxylate 
transporter is that of the erythrocyte membrane, which transports L-lactate and a wide range of other 
aliphatic monocarboxylates. Other cells possess H + -linked monocarboxylate transporters with differing 

30 substrate and inhibitor selectivities. In particular, cardiac muscle and tumor cells have transporters that 
differ in their K m values for certain substrates, including stereoselectivity for L- over D-lactate, and in 
their sensitivity to inhibitors. There are Na + -monocarboxylate cotransporters on the luminal surface of 
intestinal and kidney epithelia, which allow the uptake of lactate, pyruvate, and ketone bodies in these 
tissues. In addition, there are specific and selective transporters for organic cations and organic anions 
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in organs including the kidney, intestine and liver. Organic anion transporters are selective for 
hydrophobic, charged molecules with electron-attracting side groups. Organic cation transporters, such 
as the ammonium transporter, mediate the secretion of a variety of drugs and endogenous metabolites, 
and contribute to the maintenance of intercellular pH (Poole, R.C. and A.P. Halestrap (1993) Am. J. 
5 Physiol. 264:C761-C782; Price, N.T. et al. (1998) Biochem. J. 329:321-328; and Martinelle, K. and I. 
Haggstrom (1993) J. Biotechnol. 30:339-350). 

ATP-binding cassette (ABC) transporters are members of a superfamily of membrane proteins 
that transport substances ranging from small molecules such as ions, sugars, amino acids, peptides, and 
phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC transporters 

10 consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 
energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the case 
for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 

15 homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 

histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects in 
ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 

20 (sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 

ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used in 
chemotherapy (Taglicht, D. and S. Michaelis (1998) Meth. Enzymol. 292:130-162). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, selenium, 
nickel, and chromium are important as cofactors for a number of enzymes. For example, copper is 

25 involved in hemoglobin synthesis, connective tissue metabolism, and bone development, by acting as a 
cofactor in oxidoreductases such as superoxide dismutase, ferroxidase (ceruloplasmin), and lysyl 
oxidase. Copper and other metal ions must be provided in the diet, and are absorbed by transporters in 
the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and other target organs, 
where specific transporters move the ions into cells and cellular organelles as needed. Imbalances in 

30 metal ion metabolism have been associated with a number of disease states (Danks, D.M. (1986) J. 
Med. Genet. 23:99-106). 

Transport of fatty acids across the plasma membrane can occur by diffusion, a high capacity, 
low affinity process. However, under normal physiological conditions a significant fraction of fatty 
acid transport appears to occur via a high affinity, low capacity protein-mediated transport process. 
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Fatty acid transport protein (FATP), an integral membrane protein with four transmembrane segments, 
is expressed in tissues exhibiting high levels of plasma membrane fatty acid flux, such as muscle, heart, 
and adipose. Expression of FATP is upregulated in 3T3-L1 cells during adipose conversion, and 
expression in COS7 fibroblasts elevates uptake of long-chain fatty acids (Hui, T.Y. et aL (1998) J. 
5 Biol. Chem. 273:27420-27429). 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the pyruvate 
carrier; the dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the 

10 tricarboxylate carrier which transports citrate and malate; and the Grave's disease carrier protein, a 
protein recognized by IgG inpatients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism. Proteins in this family consist of three tandem repeats of an approximately 100 amino 
acid domain, each of which contains two transmembrane regions (Stryer, L. (1995) Biochemistry , W.H. 
Freeman and Company, New York NY, p. 551 ; PROSITE PDOC00189 Mitochondrial energy transfer 

15 proteins signature; Online Mendelian Inheritance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation from 
ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling proteins 
have been implicated as modulators of thermoregulation and metabolic rate, and have been proposed as 

20 potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. (1999) J. Int. 
Med. 245:637-642). 
Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form ion- 

25 selective pores within the membrane. There are two basic types of ion channels, ion transporters and 
gated ion channels. Ion transporters utilize the energy obtained from ATP hydrolysis to actively 
transport an ion against the ion's concentration gradient. Gated ion channels allow passive flow of an 
ion down the ion's electrochemical gradient under restricted conditions. Together, these types of ion 
channels generate, maintain, and utilize an electrochemical gradient that is used in 1) electrical impulse 

30 conduction down the axon of a nerve cell, 2) transport of molecules into cells against concentration 
gradients, 3) initiation of muscle contraction, and 4) endocrine cell secretion. 
Ion Transporters 

Ion transporters generate and maintain the resting electrical potential of a cell. Utilizing the 
energy derived from ATP hydrolysis, they transport ions against the ion's concentration gradient. 
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These transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ - ATPase, and H + - ATPase, are activated by a 
phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration of 
5 K + is high. The vacuolar (V) class of ion transporters includes H + pumps on intracellular organelles, 
such as lysosomes and Golgi. V-class ion transporters are responsible for generating the low pH within 
the lumen of these organelles that is required for function. The coupling factor (F) class consists of H + 
pumps in the mitochondria. F-class ion transporters utilize a proton gradient to generate ATP from 
ADP and inorganic phosphate (P 4 ). 

10 The P- ATPases are hexamers of a 100 kD subunit with ten transmembrane domains and 

several large cytoplasmic regions that may play a role in ion binding (Scarborough, G. A. (1999) Curr. 
Opin. Cell Biol. 1 1:517-522). The V- ATPases are composed of two functional domains: the V x 
domain, a peripheral complex responsible for ATP hydrolysis; and the V 0 domain, an integral 
complex responsible for proton translocation across the membrane. The F- ATPases are structurally 

15 and evolutionarily related to the V- ATPases. The F- ATPase F 0 domain contains 12 copies of the c 
subunit, a highly hydrophobic protein composed of two transmembrane domains and containing a 
single buried carboxyl group in TM2 that is essential for proton transport. The V-ATPase V 0 domain 
contains three types of homologous c subunits with four or five transmembrane domains and the 
essential carboxyl group in TM4 or TM3. Both types of complex also contain a single a subunit that 

20 may be involved in regulating the pH dependence of activity (Forgac, M. (1999) J. Biol. Chem. 
274:12951-12954). 

The resting potential of the cell is utilized in many processes involving carrier proteins and 
gated ion channels. Carrier proteins utilize the resting potential to transport molecules into and out of 
the cell. Amino acid and glucose transport into many cells is linked to sodium ion co-transport 
25 (symport) so that the movement of Na + down an electrochemical gradient drives transport of the other 
molecule up a concentration gradient. Similarly, cardiac muscle links transfer of Ca 2+ out of the cell 
with transport of Na + into the cell (antiport). 
Gated Ion Channels 

Gated ion channels control ion flow by regulating the opening and closing of pores. The 
30 ability to control ion flux through various gating mechanisms allows ion channels to mediate such 
diverse signaling and homeostatic functions as neuronal and endocrine signaling, muscle contraction, 
fertilization, and regulation of ion and pH balance. Gated ion channels are categorized according to 
the manner of regulating the gating function. Mechanically-gated channels open their pores in 
response to mechanical stress; voltage-gated channels (e.g., Na + , K*, Ca 2+ , and CI" channels) open 
35 their pores in response to changes in membrane potential; and ligand-gated channels (e.g., 
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acetylcholine-, serotonin-, and glutamate-gated cation channels, and GABA- and glycine-gated 
chloride channels) open their pores in the presence of a specific ion, nucleotide, or neurotransmitter. 
The gating properties of a particular ion channel (i.e., its threshold for and duration of opening and 
closing) are sometimes modulated by association with auxiliary channel proteins and/or post 
5 translational modifications, such as phosphorylation. 

Mechanically-gated or mechanosensitive ion channels act as transducers for the senses of 
touch, hearing, and balance, and also play important roles in cell volume regulation, smooth muscle 
contraction, and cardiac rhythm generation. A stretch-inactivated channel (SIC) was recently cloned 
from rat kidney. The SIC channel belongs to a group of channels which are activated by pressure or 
10 stress on the cell membrane and conduct both Ca 2+ and Na + (Suzuki, M. et al. (1999) J. Biol. Chem. 
274:6330-6335). 

The pore-forming subunits of the voltage-gated cation channels form a superfamily of ion 
channel proteins. The characteristic domain of these channel proteins comprises six transmembrane 
domains (S1-S6), a pore-forming region (P) located between S5 and S6, and intracellular amino and 

1 5 carboxy termini. In the Na + and Ca 2+ subfamilies, this domain is repeated four times, while in the K + 
channel subfamily, each channel is formed from a tetramer of either identical or dissimilar subunits. 
The P region contains information specifying the ion selectivity for the channel. In the case of K + 
channels, a GYG tripeptide is involved in this selectivity (Ishii, T.M. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:11651-11656). 

20 Voltage-gated Na + and K + channels are necessary for the function of electrically excitable cells, 

such as nerve and muscle cells. Action potentials, which lead to neurotransmitter release and muscle 
contraction, arise from large, transient changes in the permeability of the membrane to Na + and K + 
ions. Depolarization of the membrane beyond the threshold level opens voltage-gated Na + channels. 
Sodium ions flow into the cell, further depolarizing the membrane and opening more voltage-gated Na + 

25 channels, which propagates the depolarization down the length of the cell. Depolarization also opens 
voltage-gated potassium channels. Consequently, potassium ions flow outward, which leads to 
repolarization of the membrane. Voltage-gated channels utilize charged residues in the fourth 
transmembrane segment (S4) to sense voltage change. The open state lasts only about 1 millisecond, at 
which time the channel spontaneously converts into an inactive state that cannot be opened irrespective 

30 of the membrane potential. Inactivation is mediated by the channel's N-terminus, which acts as a plug 
that closes the pore. The transition from an inactive to a closed state requires a return to resting 
potential. 

Voltage-gated Na + channels are heterotrimeric complexes composed of a 260 kDa pore-forming 
a subunit that associates with two smaller auxiliary subunits, (31 and J32. The J32 subunit is a integral 
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membrane glycoprotein that contains an extracellular Ig domain, and its association with a and pi 
subunits correlates with increased functional expression of the channel, a change in its gating 
properties, as well as an increase in whole cell capacitance due to an increase in membrane surface area 
(Isom, L.L. et al. (1995) Cell 83:433-442). 
5 Non voltage-gated Na + channels include the members of the amiloride-sensitive Na + 

channel/degenerin (NaC/DEG) family. Channel subunits of this family are thought to consist of two 
transmembrane domains flanking a long extracellular loop, with the amino and carboxyl termini located 
within the cell. The NaC/DEG family includes the epithelial Na + channel (ENaC) involved in Na + 
reabsorption in epithelia including the airway, distal colon, cortical collecting duct of the kidney, and 

10 exocrine duct glands. Mutations in ENaC result in pseudohypoaldosteronism type 1 and Liddle's 

syndrome (pseudohyperaldosteronism). The NaC/DEG family also includes the recently characterized 
H*-gated cation channels or acid-sensing ion channels (ASIC). ASIC subunits are expressed in the 
brain and form heteromultimeric Na + -permeable channels. These channels require acid pH fluctuations 
for activation. ASIC subunits show homology to the degenerins, a family of mechanically-gated 

15 channels originally isolated from C. elegans. Mutations in the degenerins cause neurodegeneration. 
ASIC subunits may also have a role in neuronal function, or in pain perception, since tissue acidosis 
causes pain (Waldmann, R. and M. Lazdunski (1998) Curr. Opin. Neurobiol. 8:418-424; Eglen, R.M. 
et al. (1999) Trends Pharmacol. Sci. 20:337-342). 

K + channels are located in all cell types, and may be regulated by voltage, ATP concentration, 

20 or second messengers such as Ca 2+ and cAMP. In non-excitable tissue, K + channels are involved in 
protein synthesis, control of endocrine secretions, and the maintenance of osmotic equilibrium across 
membranes. In neurons and other excitable cells, in addition to regulating action potentials and 
repolarizing membranes, K + channels are responsible for setting resting membrane potential. The 
cytosol contains non-diffusible anions and, to balance this net negative charge, the cell contains a Na + - 

25 K + pump and ion channels that provide the redistribution of Na + , K + , and Cl~. The pump actively 

transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the plasma membrane 
allow K + and CI " to flow by passive diffusion. Because of the high negative charge within the cytosol, 
CI ' flows out of the cell. The flow of K + is balanced by an electromotive force pulling K + into the cell, 
and a K + concentration gradient pushing K + out of the cell. Thus, the resting membrane potential is 

30 primarily regulated by K + flow (Salkoff, L. and T. Jegla (1995) Neuron 154S9-492). 

Potassium channel subunits of the Shaker -like superf amily all have the characteristic six 
transmembrane/1 pore domain structure. Four subunits combine as homo- or heterotetramers to form 
functional K channels. These pore-forming subunits also associate with various cytoplasmic p 
subunits that alter channel inactivation kinetics. The Shaker -like channel family includes the voltage- 
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gated K + channels as well as the delayed rectifier type channels such as the human ether-a-go-go 
related gene (HERG) associated with long QT, a cardiac dysrythmia syndrome (Curran, M.E. (1998) 
Cuir. Opin. Biotechnol. 9:565-572; Kaczorowski, G.J. and M.L. Garcia (1999) Curr. Opin. Chem. 
Biol. 3:448-458). 

5 A second superfamily of K + channels is composed of the inward rectifying channels (Kir). 

Kir channels have the property of preferentially conducting K* currents in the inward direction. These 
proteins consist of a single potassium selective pore domain and two transmembrane domains, which 
correspond to the fifth and sixth transmembrane domains of voltage-gated K + channels. Kir subunits 
also associate as tetramers. The Kir family includes ROMK1, mutations in which lead to Bartter 

10 syndrome, a renal tubular disorder. Kir channels are also involved in regulation of cardiac pacemaker 
activity, seizures and epilepsy, and insulin regulation (Doupnik, C.A. et al. (1995) Curr. Opin. 
Neurobiol. 5:268-277; Curran, supra) . 

The recently recognized TWIK K + channel family includes the mammalian TWIK-1 , TREK- 1 
and TASK proteins. Members of this family possess an overall structure with four transmembrane 

15 domains and two P domains. These proteins are probably involved in controlling the resting potential 
in a large set of cell types (Duprat, F. et al. (1997) EMBO J 16:5464-5471). 

The voltage-gated Ca 2+ channels have been classified into several subtypes based upon their 
electrophysiological and pharmacological characteristics. L-type Ca 2+ channels are predominantly 
expressed in heart and skeletal muscle where they play an essential role in excitation-contraction 

20 coupling. T-type channels are important for cardiac pacemaker activity, while N-type and P/Q-type 
channels are involved in the control of neurotransmitter release in the central and peripheral nervous 
system. The L-type and N-type voltage-gated Ca 2+ channels have been purified and, though their 
functions differ dramatically, they have similar subunit compositions. The channels are composed of 
three subunits. The a x subunit forms the membrane pore and voltage sensor, while the ct^S and P 

25 subunits modulate the voltage-dependence, gating properties, and the current amplitude of the channel. 
These subunits are encoded by at least six a l9 one c^S, and four P genes. A fourth subunit, y, has been 
identified in skeletal muscle (Walker, D. et al. (1998) J. Biol. Chem. 273:2361-2367; McCleskey, E.W. 
(1994) Curr. Opin. Neurobiol. 4:304-312). 

The transient receptor family (Trp) of calcium ion channels are thought to mediate capacitative 

30 calcium entry (CCE). CCE is the Ca 2+ influx into cells to resupply Ca 2+ stores depleted by the action of 
inositol triphosphate (IP3) and other agents in response to numerous hormones and growth factors. Trp 
and Trp-like were first cloned from Drosophila and have similarity to voltage gated Ca2+ channels in 
the S3 through S6 regions. This suggests that Trp and/or related proteins may form mammalian CCC 
entry channels (Zhu, X. et al. (1996) Cell 85:661-671; Boulay, G. et al. (1997) J. Biol. Chem. 
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272:29672-29680). Melastatin is a gene isolated in both the mouse and human, and whose expression 
in melanoma cells is inversely correlated with melanoma aggressiveness in vivo. The human cDNA 
transcript corresponds to a 1533-amino acid protein having homology to members of the Trp family. It 
has been proposed that the combined use of malastatin niRNA expression status and tumor thickness 
5 might allow for the determination of subgroups of patients at both low and high risk for developing 
metastatic disease (Duncan, L.M. et al (2001) J. Clin. Oncol. 19:568-576). 

Chloride channels are necessary in endocrine secretion and in regulation of cytosolic and 
organelle pH. In secretory epithelial cells, CV enters the cell across a basolateral membrane through an 
Na + , K7C1 * cotransporter, accumulating in the cell above its electrochemical equilibrium concentration. 

1 0 Secretion of CI " from the apical surface, in response to hormonal stimulation, leads to flow of Na + and 
water into the secretory lumen. The cystic fibrosis transmembrane conductance regulator (CFTR) is a 
chloride channel encoded by the gene for cystic fibrosis, a common fatal genetic disorder in humans. 
CFTR is a member of the ABC transporter family, and is composed of two domains each consisting of 
six transmembrane domains followed by a nucleotide-binding site. Loss of CFTR function decreases 

15 transepithelial water secretion and, as a result, the layers of mucus that coat the respiratory tree, 
pancreatic ducts, and intestine are dehydrated and difficult to clear. The resulting blockage of these 
sites leads to pancreatic insufficiency, "meconium ileus", and devastating "chronic obstructive 
pulmonary disease" (Al-Awqati, Q. et al. (1992) J. Exp. Biol. 172:245-266). 

The voltage-gated chloride channels (CLC) are characterized by 10-12 transmembrane 

20 domains, as well as two small globular domains known as CBS domains. The CLC subunits probably 
function as homotetramers. CLC proteins are involved in regulation of cell volume, membrane 
potential stabilization, signal transduction, and transepithelial transport. Mutations in CLC-1, 
expressed predominantly in skeletal muscle, are responsible for autosomal recessive generalized 
myotonia and autosomal dominant myotonia congenita, while mutations in the kidney channel CLC -5 

25 lead to kidney stones (Jentsch, T.J. (1996) Curr. Opin. Neurobiol. 6:303-310). 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds to 
the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter binds to 
their extracellular domain. These channels exist in the postsynaptic membrane of nerve or muscle cells. 
There are two types of neurotransmitter-gated channels. Sodium channels open in response to 

30 excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This opening causes an 
influx of Na + and produces the initial localized depolarization that activates the voltage-gated channels 
and starts the action potential. Chloride channels open in response to inhibitory neurotransmitters, such 
as y-aminobutyric acid (GAB A) and glycine, leading to hyperpolarization of the membrane and the 
subsequent generation of an action potential. Neurotransmitter-gated ion channels have four 

9 



BNSDOCID: <WO 0204520A2J_> 



WO 02/04520 



PCT/US01/21448 



transmembrane domains and probably function as pentamers (Jentsch, supra) . Amino acids in the 
second transmembrane domain appear to be important in determining channel permeation and 
selectivity (Sather, W.A. et al. (1994) Curr. Opin. Neurobiol. 4:313-323). 

Ligand-gated channels can be regulated by intracellular second messengers. For example, 
5 calcium-activated K* channels are gated by internal calcium ions. In nerve cells, an influx of calcium 
during depolarization opens K + channels to modulate the magnitude of the action potential (Ishi et al., 
supra) . The large conductance (BK) channel has been purified from brain and its subunit composition 
determined. The a subunit of the BK channel has seven rather than six transmembrane domains in 
contrast to voltage-gated K + channels. The extra transmembrane domain is located at the subunit N- 

10 terminus. A 28-amino-acid stretch in the C-terminal region of the subunit (the "calcium bowl" region) 
contains many negatively charged residues and is thought to be the region responsible for calcium 
binding. The P subunit consists of two transmembrane domains connected by a glycosylated 
extracellular loop, with intracellular N- and C-termini (Kaczorowski, supra ; Vergara, C. et al. (1998) 
Curr. Opin. Neurobiol. 8:321-329). 

15 Cyclic nucleotide-gated (CNG) channels are gated by cytosolic cyclic nucleotides. The best 

examples of these are the cAMP-gated Na + channels involved in olfaction and the cGMP-gated cation 
channels involved in vision. Both systems involve ligand-mediated activation of a G-protein coupled 
receptor which then alters the level of cyclic nucleotide within the cell. CNG channels also represent a 
major pathway for Ca 2+ entry into neurons, and play roles in neuronal development and plasticity. 

20 CNG channels are tetramers containing at least two types of subunits, an a subunit which can form 
functional homomeric channels, and a p subunit, which modulates the channel properties. All CNG 
subunits have six transmembrane domains and a pore forming region between the fifth and sixth 
transmembrane domains, similar to voltage-gated K + channels. A large C-terminal domain contains a 
cyclic nucleotide binding domain, while the N-terminal domain confers variation among channel 

25 subtypes (Zufall, F. et al. (1997) Curr. Opin. Neurobiol. 7:404-412). 

The activity of other types of ion channel proteins may also be modulated by a variety of 
intracellular signalling proteins. Many channels have sites for phosphorylation by one or more protein 
kinases including protein kinase A, protein kinase C, tyrosine kinase, and casein kinase II, all of 
which regulate ion channel activity in cells. Kir channels are activated by the binding of the Gpy 

30 subunits of heterotrimeric G-proteins (Reimann, F. and F.M. Ashcroft (1999) Curr. Opin. Cell. Biol. 
1 1 :503-508). Other proteins are involved in the localization of ion channels to specific sites in the cell 
membrane. Such proteins include the PDZ domain proteins known as MAGUKs (membrane-associated 
guanylate kinases) which regulate the clustering of ion channels at neuronal synapses (Craven, S.E. and 
D.S. Bredt (1998) Cell 93:495-498). 
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Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g., cystic fibrosis, glucose-galactose 
5 malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, andFanconi disease (van't Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332:1475-1480). 

10 Human diseases caused by mutations in ion channel genes include disorders of skeletal muscle, 

cardiac muscle, and the central nervous system. Mutations in the pore-forming subunits of sodium and 
chloride channels* cause myotonia, a muscle disorder in which relaxation after voluntary contraction is 
delayed. Sodium channel myotonias have been treated with channel blockers. Mutations in muscle 
sodium and calcium channels cause forms of periodic paralysis, while mutations in the sarcoplasmic 

15 calcium release channel, T-tubule calcium channel, and muscle sodium channel cause malignant 

hyperthermia. Cardiac arrythmia disorders such as the long QT syndromes and idiopathic ventricular 
fibrillation are caused by mutations in potassium and sodium channels (Cooper, E.C. and L. Y. Jan 
(1998) Proc. Natl. Acad. Sci. USA 96:4759-4766). All four known human idiopathic epilepsy genes 
code for ion channel proteins (Berkovic, S.F. and I.E. Scheffer (1999) Curr. Opin. Neurology 12:177- 

20 182). Other neurological disorders such as ataxias, hemiplegic migraine and hereditary deafness can 
also result from mutations in ion channel genes (Jen, J. (1999) Curr. Opin. Neurobiol. 9:274-280; 
Cooper, supra) . 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 

25 Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 
and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
Various classes of ion channels also play an important role in the perception of pain, and thus are 
potential targets for new analgesics. These include the vanilloid-gated ion channels, which are activated 
by the vanilloid capsaicin, as well as by noxious heat. Local anesthetics such as lidocaine and 

30 mexiletine which blockade voltage-gated Na + channels have been useful in the treatment of neuropathic 
pain (Eglen, supra) . 

Ion channels in the immune system have recently been suggested as targets for 
immunomodulation. T-cell activation depends upon calcium signaling, and a diverse set of T-cell 
specific ion channels has been characterized that affect this signaling process. Channel blocking agents 
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can inhibit secretion of lymphokines, cell proliferation, and killing of target cells. A peptide antagonist 
of the T-cell potassium channel Kvl.3 was found to suppress delayed-type hypersensitivity and 
allogenic responses in pigs, validating the idea of channel blockers as safe and efficacious 
immunosuppressants (Cahalan, M.D. and K.G. Chandy (1997) Curr. Opin. Biotechnol. 8:749-756). 
5 The discovery of new transporters and ion channels, and the polynucleotides encoding them, 

satisfies a need in the art by providing new compositions which are useful in the diagnosis, prevention, 
and treatment of transport, neurological, muscle, immunological, and cell proliferative disorders, and in 
the assessment of the effects of exogenous compounds on the expression of nucleic acid and amino acid 
sequences of transporters and ion channels. 

10 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, transporters and ion channels, referred to 
collectively as "TRICH" and individually as 'TRICH-1," 'TRICH-2," "TRICH-3," 'TRICH-4," 
'TRICH-5," "TRICH-6," "TRICH-7," 'TRICH-8," 'TRICH-9 "TRICR-IO," "TRICH-1 1," 

15 "TRICH-12," "TRICH-13," "TRICH-14," "TRICH-15," "TRICH-16," "TRICH-17," "TRICH-18," 
"TRICH-19 " "TRICH-20 " "TRICH-2V "TRICH-22," "TRICH-23," "TRICH-24," "TRICH-25," 
"TRICH-26," "TRICH-27," 'TRICH-28," 'TRICH-29," "TRICH-30," "TRICH-31," and "TRICH- 
32." In one aspect, the invention provides an isolated polypeptide selected from the group consisting of 
a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 

20 NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically 
active fragment of a polypeptide having an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. In one alternative, the invention provides an 

25 isolated polypeptide comprising the amino acid sequence of SEQ ID NO:l-32. 

The invention further provides an isolated polynucleotide encoding a polypeptide selected from 
the group consisting of a) a polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence 
at least 90% identical to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 

30 32, c) a biologically active fragment of a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32. In one alternative, the 
polynucleotide encodes a polypeptide selected from the group consisting of SEQ ID NO: 1-32. In 
another alternative, the polynucleotide is selected from the group consisting of SEQ ID NO:33-64. 
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Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide selected from the group consisting 
of a) a polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
5 to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, c) a biologically 
active fragment of a polypeptide having an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-32. In one alternative, the invention provides a cell 
transformed with the recombinant polynucleotide. In another alternative, the invention provides a 

10 transgenic organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide selected from the group 
consisting of a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO.1-32, b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% 
identical to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, c) a 

15 biologically active fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-32, and d) an immunogenic fragment of a polypeptide having an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -32. The method comprises a) 
culturing a cell under conditions suitable for expression of the polypeptide, wherein said cell is 
transformed with a recombinant polynucleotide comprising a promoter sequence operably linked to a 

20 polynucleotide encoding the polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32, b) a polypeptide comprising a naturally 
occurring amino acid sequence at least 90% identical to an amino acid sequence selected from the group 

25 consisting of SEQ ID NO:l-32, c) a biologically active fragment of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ED NO: 1-32, and d) an immunogenic fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

The invention further provides an isolated polynucleotide selected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 

30 NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide 
comprises at least 60 contiguous nucleotides. 
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Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide selected from the group consisting of 
a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
5 identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 
polynucleotide of b), and e) an RN A equivalent of a)-d). The method comprises a) hybridizing the 
sample with a probe comprising at least 20 contiguous nucleotides comprising a sequence 
complementary to said target polynucleotide in the sample, and which probe specifically hybridizes to 

10 said target polynucleotide, under conditions whereby a hybridization complex is formed between said 
probe and said target polynucleotide or fragments thereof, and b) detecting the presence or absence of 
said hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 

15 target polynucleotide having a sequence of a polynucleotide selected from the group consisting of a) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, c) a 
polynucleotide complementary to the polynucleotide of a), d) a polynucleotide complementary to the 

20 polynucleotide of b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said 
target polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) 
detecting the presence or absence of said amplified target polynucleotide or fragment thereof, and, 
optionally, if present, the amount thereof. 

The invention farther provides a composition comprising an effective amount of a polypeptide 

25 selected from the group consisting of a) a polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a naturally occurring amino acid 
sequence at least 90% identical to an amino acid sequence selected from the group consisting of SEQ 
ID NO: 1 -32, c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic fragment of a polypeptide 

30 having an amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and a 

pharmaceutical^ acceptable excipient In one embodiment, the composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32. The invention additionally provides a 
method of treating a disease or condition associated with decreased expression of functional TRICH, 
comprising administering to a patient in need of such treatment the composition. 
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The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
5 the group consisting of SEQ ID NO: 1 -32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. Hie method comprises a) exposing a sample comprising the polypeptide to a compound, 
and b) detecting agonist activity in the sample. In one alternative, the invention provides a 

10 composition comprising an agonist compound identified by the method and a pharmaceutical^ 

acceptable excipient In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TRICH, comprising administering to a 
patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 

15 an antagonist of a polypeptide selected from the group consisting of a) a polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide 
comprising a naturally occurring amino acid sequence at least 90% identical to an amino acid sequence 
selected from the group consisting of SEQ ID NO:l-32, c) a biologically active fragment of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and 

20 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-32. Hie method comprises a) exposing a sample comprising the 
polypeptide to a compound, and b) detecting antagonist activity in the sample. In one alternative, the 
invention provides a composition comprising an antagonist compound identified by the method and a 
pharmaceutical^ acceptable excipient In another alternative, the invention provides a method of 

25 treating a disease or condition associated with overexpression of functional TRICH, comprising 
administering to a patient in need of such treatment the composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide selected from the group consisting of a) a polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-32, b) a polypeptide comprising a 

30 naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO:l-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-32. The method comprises a) combining the polypeptide with at least one test compound under 
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suitable conditions, and b) detecting binding of the polypeptide to the test compound, thereby 
identifying a compound that specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide selected from the group consisting of a) a polypeptide comprising an amino 
5 acid sequence selected from the group consisting of SEQ ED NO: 1-32, b) a polypeptide comprising a 
naturally occurring amino acid sequence at least 90% identical to an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, c) a biologically active fragment of a polypeptide having an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, and d) an immunogenic 
fragment of a polypeptide having an amino acid sequence selected from the group consisting of SEQ ID 

10 NO: 1-32. The method comprises a) combining the polypeptide with at least one test compound under 
conditions permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide 
in the presence of the test compound, and c) comparing the activity of the polypeptide in the presence 
of the test compound with the activity of the polypeptide in the absence of the test compound, 
wherein a change in the activity of the polypeptide in the presence of the test compound is indicative 

15 of a compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:33-64, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 

20 expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
contiguous nucleotides of a polynucleotide selected from the group consisting of i) a polynucleotide 

25 comprising a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, ii) a 
polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% identical to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 
polynucleotide having a sequence complementary to i), iv) a polynucleotide complementary to the 
polynucleotide of ii), and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions 

30 whereby a specific hybridization complex is formed between said probe and a target polynucleotide in 
the biological sample, said target polynucleotide selected from the group consisting of i) a 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:33-64, ii) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, iii) a 

35 polynucleotide complementary to the polynucleotide of i), iv) a polynucleotide complementary to the 
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polynucleotide of ii), and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide 
comprises a fragment of a polynucleotide sequence selected from the group consisting of i)-v) above; 
c) quantifying the amount of hybridization complex; and d) comparing the amount of hybridization 
complex in the treated biological sample with the amount of hybridization complex in an untreated 
5 biological sample, wherein a difference in the amount of hybridization complex in the treated 
biological sample is indicative of toxicity of the test compound. 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
10 sequences of the present invention. 

Table 2 shows the GenBank identification number and annotation of the nearest GenBank 
homolog for polypeptides of the invention. The probability score for the match between each 
polypeptide and its GenBank homolog is also shown. 

Table 3 shows structural features of polypeptide sequences of the invention, including predicted 
15 motifs and domains, along with the methods, algorithms, and searchable databases used for analysis of 
the polypeptides. 

Table 4 lists the cDNA and/or genomic DNA fragments which were used to assemble 
polynucleotide sequences of the invention, along with selected fragments of the polynucleotide 
sequences. 

20 Table 5 shows the representative cDNA library for polynucleotides of the invention. 

Table 6 provides an appendix which describes the tissues and vectors used for construction of 
the cDN A libraries shown in Table 5 . 

Table 7 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

25 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
30 particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
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reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
5 any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the invention is 
10 not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TRICH" refers to the amino acid sequences of substantially purified TRICH obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant 

15 The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 

TRICH. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TRICH either by directly interacting with 
TRICH or by acting on components of the biological pathway in which TRICH participates. 

An "allelic variant" is an alternative form of the gene encoding TRICH. Allelic variants may 

20 result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times in 

25 a given sequence. 

"Altered" nucleic acid sequences encoding TRICH include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TRICH or a 
polypeptide with at least one functional characteristic of TRICH. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
30 the polynucleotide encoding TRICH, and improper or unexpected hybridization to allelic variants, with 
a locus other than the normal chromosomal locus for the polynucleotide sequence encoding TRICH. 
The encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of 
amino acid residues which produce a silent change and result in a functionally equivalent TRICH. 
Deliberate amino acid substitutions may be made on the basis of similarity in polarity, charge, 

18 



NSDOCID: <WO 0204520A2_L> 



WO 02/04520 



PCT/US01/21448 



solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the residues, as long as the 
biological or immunological activity of TRICH is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, and positively charged amino acids may include 
lysine and arginine. Amino acids with uncharged polar side chains having similar hydrophilicity 
5 values may include: asparagine and glutamine; and serine and threonine. Amino acids with 
uncharged side chains having similar hydrophilicity values may include: leucine, isoleucine, and 
valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 

10 molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 

protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well known 

15 in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
TRICH. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TRICH either by 
directly interacting with TRICH or by acting on components of the biological pathway in which TRICH 
20 participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 * and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TRICH polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 

25 to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

30 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 
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The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
5 sugar groups such as 2'-methoxyethyl sugars or 2'-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2'-deoxyuracil, or 7-deaza-2-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 

10 translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TRICH, or of any oligopeptide thereof, 

15 to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, 5'-AGT-3* pairs with its complement, 
3'-TCA-5\ 

20 A "composition comprising a given polynucleotide sequence" and a "composition comprising a 

given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TRICH or fragments of TRICH may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 

25 associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 

deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 

30 Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 
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"Conservative amino acid substitutions" are those substitutions that are predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protein and which are regarded as 
5 conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 


10 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 


15 


His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, lie 


20 


Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 


25 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
30 side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide can include, for example, replacement of hydrogen by an alkyl, acyl, 
35 hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains at least one 
biological or immunological function of the natural molecule. A derivative polypeptide is one modified 
by glycosylation, pegylation, or any similar process that retains at least one biological or immunological 
function of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
40 measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

"Differential expression" refers to increased or upregulated; or decreased, downregulated, or 
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absent gene or protein expression, determined by comparing at least two different samples. Such 
comparisons may be carried out between, for example, a treated and an untreated sample, or a diseased 
and a normal sample. 

A "fragment" is a unique portion of TRICH or the polynucleotide encoding TRTCH which is 
5 identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nuclebtide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5,10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 

10 residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 cm* 500 amino acids (or first 25% or 50%) of a polypeptide as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 

15 embodiments. 

A fragment of SEQ ID NO:33-64 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO: 33-64, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:33-64 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 

20 SEQ ID NO:33-64 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:33-64 and the region of SEQ ID NO:33-64 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-32 is encoded by a fragment of SEQ ID NO:33-64. A fragment 
of SEQ ID NO: 1-32 comprises a region of unique amino acid sequence that specifically identifies 

25 SEQ ID NO:l-32. For example, a fragment of SEQ ID NO:l-32 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-32. The precise length of 
a fragment of SEQ ID NO:l-32 and the region of SEQ ID NO:l-32 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

30 A "full length'* polynucleotide sequence is one containing at least a translation initiation codon 

(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full 
length" polynucleotide sequence encodes a "full length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

35 The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
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the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore, 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e sequence 
alignment program. This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http.7/www.ncbi.nlranih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.html. The 
"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-21-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 
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Percent identity may be measured over the length of an entire defined sequence, for example, as 
defined by a particular SEQ ID number, or may be measured over a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous nucleotides. Such 
5 lengths are exemplary only, and it is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
10 a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
15 methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and_hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence alignment 
20 program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
between aligned polypeptide sequence pairs. 
25 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0. 12 
(April-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
30 Gap x drop-off: 50 

Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
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example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
5 supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

10 The term "humanized antibody" refers to an antibody molecule in which the amino acid 

sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 

15 hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 

20 annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 ng/ml sheared, denatured salmon sperm DNA. 

25 Generally, stringency of hybridization is expressed, in part, with reference to the temperature 

undo* which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T^) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 

30 for nucleic acid hybridization are well known and can be found in Sambrook, J. et al. (1989) Molecular 
Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
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Alternatively, temperatures of about 65 °C, 60°C, 55 °C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jig/ml. Organic solvent, such as 
5 formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RN A.DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 
10 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 
15 or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 
20 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of TRICH which is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 
25 TRICH which is useful in any of the antibody production methods disclosed herein or known in the art. 

The term "microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 
30 The term "modulate" refers to a change in the activity of TRICH. For example, modulation 

may cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of TRICH. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
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synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
5 linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 

10 amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TRICH may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 

15 art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu of TRICH. 

"Probe" refers to nucleic acid sequences encoding TRICH, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 

20 labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 

short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The primer may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

25 Probes and primers as used in the present invention typically comprise at least 15 contiguous 

nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20. 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 

30 specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al. (1987) Current Protocols in Molecular 
Biology , Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
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Protocols, A Guide to Methods and Applications , Academic Press, San Diego CA. PGR primer pairs 
can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Prima" (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

5 Oligonucleotides for use as primers are selected using software known in the art for such 

purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and largo* polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 

10 primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Center, Dallas TX) is capable of choosing specific primers from megabase sequences 
and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Center for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 

15 binding sites are user-specified. Primer 3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 
their respective sources and modified to meet the user's specific needs,) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 

20 hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 
polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray elements, or specific probes to identify fully or partially complementary 

25 polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
30 artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 
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Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
5 regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

"Reporter molecules" are chemical oj biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
10 chemiluminescent, or chromogenic agents; substrates; cof actors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
15 instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing TRICH, 
nucleic acids encoding TRICH, or fragments thereof may comprise a bodily fluid; an extract from a 
cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, 
in solution or bound to a substrate; a tissue; a tissue print; etc. 
20 The terms "specific binding" and "specifically binding" refer to that interaction between a 

protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
25 A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
30 they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
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microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 
5 'Transformation" describes a process by which exogenous DNA is introduced into a recipient 

cell. Transformation may occur undo: natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is selected based on the type 
of host cell being transformed and may include, but is not limited to, bacteriophage or viral infection, 

10 electroporation, heat shock, lipofection, and particle bombardment. The term "transformed cells" 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 

15 animals and plants, in which one or more of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 

20 vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 

25 into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
30 set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at 
least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence 
identity over a certain defined length. A variant may be described as, for example, an "allelic" (as 
defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
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identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRN A processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
5 polypeptides will generally have significant amino acid identity relative to each other. A polymorphic 
variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for. example, a certain population, a disease state, or a propensity for a disease state. 

10 A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at 

15 least 95%, at least 96%, at least 97%, at least 98%, or at least 99% or greater sequence identity over a 
certain defined length of one of the polypeptides. 

THE INVENTION 

The invention is based on the discovery of new human transporters and ion channels (TRICH), 
20 the polynucleotides encoding TRICH, and the use of these compositions for the diagnosis, treatment, or 
prevention of transport, neurological, muscle, immunological, and cell proliferative disorders. 

Table 1 summarizes the nomenclature for the full length polynucleotide and polypeptide 
sequences of the invention. Each polynucleotide and its corresponding polypeptide are correlated to a 
single Incyte project identification number (Incyte Project ED). Each polypeptide sequence is denoted 
25 by both a polypeptide sequence identification number (Polypeptide SEQ ID NO:) and an Incyte 
polypeptide sequence number (Incyte Polypeptide ID) as shown. Each polynucleotide sequence is 
denoted by both a polynucleotide sequence identification number (Polynucleotide SEQ ID NO:) and an 
Incyte polynucleotide consensus sequence number (Incyte Polynucleotide ID) as shown. 

Table 2 shows sequences with homology to the polypeptides of the invention as identified by 
30 BLAST analysis against the GenBank protein (genpept) database. Columns 1 and 2 show the 

polypeptide sequence identification number (Polypeptide SEQ ID NO.) and the corresponding Incyte 
polypeptide sequence number (Incyte Polypeptide ID) for polypeptides of the invention. Column 3 
shows the GenBank identification number (Genbank ID NO:) of the nearest GenBank homolog. 
Column 4 shows the probability score for the match between each polypeptide and its GenBank 
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homolog. Column 5 shows the annotation of the GenBank homolog along with relevant citations where 
applicable, all of which are expressly incorporated by reference herein. 

Table 3 shows various structural features of the polypeptides of the invention. Columns 1 and 
2 show the polypeptide sequence identification number (SEQ ID NO:) and the corresponding Incyte 
5 polypeptide sequence number (Incyte Polypeptide ID) for each polypeptide of the invention. Column 3 
shows the number of amino acid residues in each polypeptide. Column 4 shows potential 
phosphorylation sites, and column 5 shows potential glycosylation sites, as determined by the MOTIFS 
program of the GCG sequence analysis software package (Genetics Computer Group, Madison WI). 
Column 6 shows amino acid residues comprising signature sequences, domains, and motifs. Column 7 

10 shows analytical methods for protein structure/function analysis and in some cases, searchable 
databases to which the analytical methods ware applied. 

Together, Tables 2 and 3 summarize the properties of polypeptides of the invention, and these 
properties establish that the claimed polypeptides are transporters and ion channels. For example, SEQ 
ID NO:5 is 83% identical to rat GABA receptor rho-3 subunit precursor (GenBank ID gl060975) as 

15 determined by the Basic Local Alignment Search Tool (BLAST). (See Table 2.) The BLAST 

probability score is 1 ,7e-206, which indicates the probability of obtaining the observed polypeptide 
sequence alignment by chance. SEQ ID NO:5 also contains a neurotransmitter-gated ion channel 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 

20 BLIMPS, MOTIFS, and PROFILESCAN analyses provide further corroborative evidence that SEQ ID 
NO:5 is a neurotransmitter-gated ion channel. In an alternate example, SEQ ID NO:16 is 57% 
identical to human Na+/glucose cotransporter (GenBank ID g338055) as determined by the Basic Local 
Alignment Search Tool (BLAST). (See Table 2.) The BLAST probability score is 2.4e-18 1 , which 
indicates the probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID 

25 NO: 16 also contains a sodiumisolute symporter family domain as determined by searching for 
statistically significant matches in the hidden Markov model (HMM)-based PFAM database of 
conserved protein family domains. (See Table 3.) Data from BLIMPS, MOTIFS, and PROFILESCAN 
analyses provide further corroborative evidence that SEQ ID NO: 16 is a Na+/glucose cotransporter. In 
an alternate example, SEQ ID NO: 27 is 53% identical to human ATP-binding cassette transporter- 1 

30 (ABC-1) (GenBaiik ID g4128033) as determined by the Basic Local Alignment Search Tool (BLAST). 
(See Table 2.) The BLAST probability score is 0.0, which indicates the probability of obtaining the 
observed polypeptide sequence alignment by chance. SEQ ID NO:27 also contains an ABC transporter 
domain as determined by searching for statistically significant matches in the hidden Markov model 
(HMM)-based PFAM database of conserved protein family domains. (See Table 3.) Data from 
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BLIMPS, MOTIFS, and PROFBLESCAN analyses provide further corroborative evidence that SEQ ID 
NO:27 is an ABC transporter. In an alternate example, SEQ ID NO:12 is 45% identical to rat thyroid 
sodium/iodide symporter NIS (GenBank ID gl 399954) as determined by the Basic Local Alignment 
Search Tool (BLAST). (See Table 2.) The BLAST probability score is 3.0e-143, which indicates the 
5 probability of obtaining the observed polypeptide sequence alignment by chance. SEQ ID NO: 1 2 also 
contains a sodiumrsolute symporter family domain as determined by searching for statistically 
significant matches in the hidden Markov model (HMM)-based PFAM database of conserved protein 
family domains. (See Table 3.) Data from BLIMPS and PROFILESCAN analyses provide further 
corroborative evidence that SEQ ID NO: 12 is a sodium:solute symporter. SEQ ID NO: 1-4, SEQ ID 
10 NO:6-l 1, SEQ ID NO:13-15, SEQ ID NO:17-26, and SEQ ID NO:28-32 were analyzed and annotated 
in a similar manner. The algorithms and parameters for the analysis of SEQ ID NO:l-32 are described 
in Table 7. 

As shown in Table 4, the fiill length polynucleotide sequences of the present invention were 
assembled using cDNA sequences or coding (exon) sequences derived from genomic DNA, or any 

15 combination of these two types of sequences. Columns 1 and 2 list the polynucleotide sequence 
identification number (Polynucleotide SEQ ID NO:) and the corresponding Incyte polynucleotide 
consensus sequence number (Incyte Polynucleotide ID) for each polynucleotide of the invention. 
Column 3 shows the length of each polynucleotide sequence in basepairs. Column 4 lists fragments of 
the polynucleotide sequences which are useful, for example, in hybridization or amplification 

20 technologies that identify SEQ ID NO:33-64 or that distinguish between SEQ ID NO:33-64 and 
related polynucleotide sequences. Column 5 shows identification numbers corresponding to cDNA 
sequences, coding sequences (exons) predicted from genomic DNA, and/or sequence assemblages 
comprised of both cDNA and genomic DNA. These sequences were used to assemble the full length 
polynucleotide sequences of the invention. Columns 6 and 7 of Table 4 show the nucleotide start (5') 

25 and stop (3') positions of the cDNA and/or genomic sequences in column 5 relative to their respective 
full length sequences. 

The identification numbers in Column 5 of Table 4 may refer specifically, for example, to 
Incyte cDNAs along with their corresponding cDNA libraries. For example, 6724643H1 is the 
identification number of an Incyte cDNA sequence, and LUNLTMT01 is the cDNA library from 
30 which it is derived. Incyte cDNAs for which cDNA libraries are not indicated were derived from 

pooled cDNA libraries (e.g., 714955 15V1). Alternatively, the identification numbers in column 5 may 
refer to GenBank cDNAs or ESTs (e.g., g5746200) which contributed to the assembly of the full length 
polynucleotide sequences. In addition, the identification numbers in column 5 may identify sequences 
derived from the ENSEMBL (The Sanger Centre, Cambridge, UK) database (i.e. , those sequences 
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including the designation sr ENST")- Alternatively, the identification numbers in column 5 may be 
derived from the NCBI RefSeq Nucleotide Sequence Records Database (i.e., those sequences including 
the designation "NM" or "NT") or the NCBI RefSeq Protein Sequence Records (i.e. t those sequences 
including the designation "NP"). Alternatively, the identification numbers in column 5 may refer to 
5 assemblages of both cDNA and Genscan-predicted exons brought together by an "exon stitching" 
algorithm For example, Y^JOOOOOC_Nj_N 2 _YYYYY_Ns_N 4 represents a "stitched" sequence in 
which XXXXXX is the identification number of the cluster of sequences to which the algorithm was 
applied, and YYYYY is the number of the prediction generated by the algorithm, and N ]f2 j„.> if present, 
represent specific exons that may have been manually edited during analysis (See Example V). 

10 Alternatively, the identification numbers in column 5 may refer to assemblages of exons brought 

together by an "exon-stretching" algorithm. For example, FLJOOOOO£__gAAAAA_gBBBBB_l_N is the 
identification number of a "stretched" sequence, with XXXXXX being the Incyte project identification 
number, gAAAAA being the GenBank identification number of the human genomic sequence to which 
the "exon-stretching" algorithm was applied, gBBBBB being the GenBank identification number or 

15 NCBI RefSeq identification number of the nearest GenBank protein homolog, and N referring to 

specific exons (See Example V). In instances where a RefSeq sequence was used as a protein homolog 
for the "exon-stretching" algorithm, a RefSeq identifier (denoted by "NM," "NP," or "NT") may be 
used in place of the GenBank identifier (i.e., gBBBBB). 

Alternatively, a prefix identifies component sequences that were hand-edited, predicted from 

20 genomic DNA sequences, or derived from a combination of sequence analysis methods. The following 
Table lists examples of component sequence prefixes and corresponding sequence analysis methods 
associated with the prefixes (see Example IV and Example V). 



Prefix 


Type of analysis and/or examples of programs 


GNN, GFG, 
ENST 


Exon prediction from genomic sequences using, for example, 
GENSCAN (Stanford University, CA, USA) or FGENES 
(Computer Genomics Group, The Sanger Centre, Cambridge, UK). 


GBI 


Hand-edited analysis of genomic sequences. 


FL 


Stitched or stretched genomic sequences (see Example V). 



In some cases, Incyte cDNA coverage redundant with the sequence coverage shown in column 
30 5 was obtained to confirm the final consensus polynucleotide sequence, but the relevant Incyte cDNA 
identification numbers are not shown. 

Table 5 shows the representative cDNA libraries for those full length polynucleotide sequences 
which were assembled using Incyte cDNA sequences. The representative cDNA library is the Incyte 
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cDNA library which is most frequently represented by the Incyte cDNA sequences which were used to 
assemble and confirm the above polynucleotide sequences. The tissues and vectors which were used to 
construct the cDNA libraries shown in Table 5 are described in Table 6. 

The invention also encompasses TRICH variants. A preferred TRICH variant is one which has 
5 at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the TRICH amino acid sequence, and which contains at least one functional or structural 
characteristic of TRICH. 

The invention also encompasses polynucleotides which encode TRICH. In a particular 
embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 

10 the group consisting of SEQ ID NO:33-64, which encodos TRICH. The polynucleotide sequences of 
SEQ ID NO:33-64, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

The invention also encompasses a variant of a polynucleotide sequence encoding TRICH. In 

15 particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding TRICH. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:33-64 which has at 
least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 

20 sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO:33-64. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of TRICH. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
code, a multitude of polynucleotide sequences encoding TRICH, some bearing minimal similarity to the 

25 polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 

invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 
occurring TRICH, and all such variations are to be considered as being specifically disclosed. 

30 Although nucleotide sequences which encode TRICH and its variants are generally capable of 

hybridizing to the nucleotide sequence of the naturally occurring TRICH under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TRICH or 
its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
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occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TRICH and its derivatives without altering the encoded amino acid sequences 
include the production of RNA transcripts having more desirable properties, such as a greater half-life, 
5 than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TRICH and 
TRICH derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
10 mutations into a sequence encoding TRICH or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:33-64 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
15 152:507-5 11.) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the invention. The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 

20 Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 

combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 

25 (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing 
system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale C A), or other systems known in the art. The resulting sequences are analyzed using a 
variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short Protocols 
in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) Molecular 

30 Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding TRICH may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
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DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
5 Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
( 199 1 ) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 

10 are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 primer analysis software (National Biosciences, 

15 Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68 °C to 
72°C. 

When screening for full length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 

20 sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5* 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 

25 sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 

30 controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode TRICH may be cloned in recombinant DNA molecules that direct expression of TRICH, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of 
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the genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express TRICH. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TRICH-encoding sequences for a variety of purposes including, but 
5 not limited to, modification of the cloning, processing, and/or expression of the gene product DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create^new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

10 The nucleotides of the present invention may be subjected to DNA shuffling techniques such 

as MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of TRICH, such as its biological or enzymatic activity or its ability 

15 to bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 

20 breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 

25 manner. 

In another embodiment, sequences encoding TRICH may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
TRICH itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
30 synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 

Creighton, T. (1984) Proteins, Structures and Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 431 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of TRICH, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
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from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
5 The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TRICH, the nucleotide sequences encoding TRICH or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 

10 suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 

inducible promoters, and 5' and 3* untranslated regions in the vector and in polynucleotide sequences 
encoding TRICH. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TRICH. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 

15 sequences encoding TRICH and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous translational elements and initiation codons may be of various origins, both natural 

20 and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding TRICH and appropriate transcriptional and translational control 

25 elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory 
Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology , John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 

30 encoding TRICH. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
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animal cell systems. (See, e.g., Sambrook, supra ; Ausubel, supra ; Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. USA 
91:3224-3227; Sandig, V. et al. (1996) Hum. Gene TTier. 7:1937-1945; Takamatsu, N. (1987) EMBO 
J. 6:307-31 1 ; The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New 
5 York NY, pp. 191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and 
Harrington, J J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 

10 USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Venna, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TRICH. For example, routine cloning, 

15 subcloning, and propagation of polynucleotide sequences encoding TRICH can be achieved using a 

multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 plasmid 
(Life Technologies). Ligation of sequences encoding TRICH into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 

20 transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 

deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of TRICH are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of TRICH may be used. For example, vectors containing the 
strong, inducible SP6 or T7 bacteriophage promoter may be used. 

25 Yeast expression systems may be used for production of TRICH. A number of vectors 

containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable integration 
of foreign sequences into the host genome for stable propagation. (See, e.g. , Ausubel, 1 995 , supra ; 

30 Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, CA. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of TRICH. Transcription of sequences encoding 
TRICH may be driven by viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
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Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g„ Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) Science 
224:838-843; and Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105.) These constructs can 
be introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
5 e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding TRICH may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 

10 sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TRICH in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

15 Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 

DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, JJ. et al. (1997) Nat. Genet. 15:345-355.) 
For long term production of recombinant proteins in mammalian systems, stable expression of 

20 TRICH in cell lines is preferred. For example, sequences encoding TRICH can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before 
being switched to selective media. The purpose of the selectable marker is to confer resistance to a 

25 selective agent, and its presence allows growth and recovery of cells which successfully express the 
introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, the herpes simplex virus thymidine kinase and adenine phosphoribosyltransferase 

30 genes, for use in tk" and apr cells, respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418 ; and als and pat confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
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Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), 6 
5 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C. A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 

10 also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TRICH is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TRICH can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding TRICH under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 

15 expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TRICH and that express 
TRICH may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DN A-DN A or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 

20 chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 
Immunological methods for detecting and measuring the expression of TRICH using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 

25 antibodies reactive to two non-interfering epitopes on TRICH is preferred, but a competitive binding 
assay may be employed. These and other assays are well known in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, Sect. IV; Coligan, J.E. 
et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana Press, Totowa NJ.) 

30 A wide variety of labels and conjugation techniques are known by those skilled in the art and 

may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding TRICH include 
oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TRICH, or any fragments thereof, may be cloned into a vector for 
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the production of an mRN A probe. Such vectors are known in the art, are commercially available, and 
may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as 
T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
5 (Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for ease 
of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRICH may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 

10 produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode TRICH may be designed to contain signal sequences which direct 
secretion of TRICH through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 

15 inserted sequences or to process the expressed protein in the desired fashion. Such modifications of the 
polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different host cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 

20 (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 

Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TRICH may be ligated to a heterologous sequence resulting in translation of a 

25 fusion protein in any of the aforementioned host systems. For example, a chimeric TRICH protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TRICH activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available affinity 
matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), maltose 

30 binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and 
hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion 
proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
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these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
located between the TRICH encoding sequence and the heterologous protein sequence, so that TRICH 
may be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra , ch. 10). A variety of commercially 
5 available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TRICH may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the T7, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 

10 example, 35 S-methionine. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
that specifically bind to TRICH. At least one and up to a plurality of test compounds may be 
screened for specific binding to TRICH. Examples of test compounds include antibodies, 
oligonucleotides, proteins (e.g., receptors), or small molecules. 

15 In one embodiment, the compound thus identified is closely related to the natural ligand of 

TRICH, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TRICH 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 

20 compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TRICH, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila , or E 
coli . Cells expressing TRICH or cell membrane fractions which contain TRICH are then contacted 
with a test compound and binding, stimulation, or inhibition of activity of either TRICH or the 

25 compound is analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TRICH, either in 
solution or affixed to a solid support, and detecting the binding of TRICH to the compound. 

30 Alternatively, the assay may detect or measure binding of a test compound in the presence of a 

labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

TRICH of the present invention or fragments thereof may be used to screen for compounds 
35 that modulate the activity of TRICH. Such compounds may include agonists, antagonists, or partial 

44 

NSDOCID: <WO 0204520A2_I_> 



WO 02/04520 



PCT/USO 1/2 1448 



or inverse agonists. In one embodiment, an assay is performed under conditions permissive for TRICH 
activity, wherein TRICH is combined with at least one test compound, and the activity of TRICH in the 
presence of a test compound is compared with the activity of TRICH in the absence of the test 
compound. A change in the activity of TRICH in the presence of the test compound is indicative of a 
5 compound that modulates the activity of TRICH. Alternatively, a test compound is combined with an 
in vitro or cell-free system comprising TRICH under conditions suitable for TRICH activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of TRICH 
may do so indirectly and need not come in direct contact with the test compound. At least one and up to 
a plurality of test compounds may be screened. 

10 In another embodiment, polynucleotides encoding TRICH or their mammalian homologs may 

be "knocked out" in an animal model system using homologous recombination in embryonic stem (ES) 
cells. Such techniques are well known in the art and are useful for the generation of animal models of 
human disease. (See, e.g., U.S. Patent Number 5,175,383 and U.S. Patent Number 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse embryo 

15 and grown in culture. The ES cells are transformed with a vector containing the gene of interest 

disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. (1989) 
Science 244:1288-1292). The vector integrates into the corresponding region of the host genome by 
homologous recombination. Alternatively, homologous recombination takes place using the Cre-loxP 
system to knockout a gene of interest in a tissue- or developmental stage-specific manner (Marth, J.D. 

20 (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids Res. 25:4323-4330). 
Transformed ES cells are identified and microinjected into mouse cell blastocysts such as those from 
the C57BL/6 mouse strain. The blastocysts are surgically transferred to pseudopregnant dams, and the 
resulting chimeric progeny are genotyped and bred to produce heterozygous or homozygous strains. 
Transgenic animals thus generated may be tested with potential therapeutic or toxic agents. 

25 Polynucleotides encoding TRICH may also be manipulated in vitro in ES cells derived from 

human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. (1998) 
Science 282:1145-1147). 

30 Polynucleotides encoding TRICH can also be used to create "knockin" humanized animals 

(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a region 
of a polynucleotide encoding TRICH is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the blastulae 
are implanted as described above. Transgenic progeny or inbred lines are studied and treated with 
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potential pharmaceutical agents to obtain information on treatment of a human disease. Alternatively, a 
mammal inbred to overexpress TRICH, e.g., by secreting TRICH in its milk, may also serve as a 
convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

5 Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 

between regions of TRICH and transporters and ion channels. In addition, the expression of TRICH 
is closely associated with adrenal, testicular, and prostate tumors, Crohn's disease, teratocarcinoma 
and dendritic cells, brain, lung, ileum, small intestine, uterine myometrial, colon, and pancreatic 
tissues. Therefore, TRICH appears to play a role in transport, neurological, muscle, immunological, 

10 and cell proliferative disorders. In the treatment of disorders associated with increased TRICH 

expression or activity, it is desirable to decrease the expression or activity of TRICH. In the treatment 
of disorders associated with decreased TRICH expression or activity, it is desirable to increase the 
expression or activity of TRICH. 

Therefore, in one embodiment, TRICH or a fragment or derivative thereof may be 

15 administered to a subject to treat or prevent a disorder associated with decreased expression or 

activity of TRICH. Examples of such disorders include, but are not limited to, a transport disorder 
such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's 
muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, 
diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic 

20 periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia 
gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral 
neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 

25 ethanol myopathy, dermatomyositis, inclusion body myositis, infectious myositis, polymyositis, 

neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia, and other 
disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 
neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 

30 stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, 

35 Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other 
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extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive 
neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other 
demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural 
abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous 
5 system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 

10 nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, deimatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 
disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 

15 postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, 
Duchenne's muscular dystrophy, Becker* s muscular dystrophy, myotonic dystrophy, central core 
disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis, polymyositis, deimatomyositis, inclusion body myositis, thyrotoxic myopathy, 

20 ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and 
myopathies including encephalopathy, epilepsy, Kearns-Sayre syndrome, lactic acidosis, myoclonic 
disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 

25 adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 

30 gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 

35 colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
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circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
5 adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 

particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 
prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TRICH or a fragment or derivative 

10 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TRICH including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified TRICH in 
conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a 
disorder associated with decreased expression or activity of TRICH including, but not limited to, those 

15 provided above. 

In still another embodiment, an agonist which modulates the activity of TRICH may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of TRICH including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TRICH may be administered to a subject to treat or 

20 prevent a disorder associated with increased expression or activity of TRICH. Examples of such 

disorders include, but are not limited to, those transport, neurological, muscle, immunological, and cell 
proliferative disorders described above. In one aspect, an antibody which specifically binds TRICH 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
pharmaceutical agent to cells or tissues which express TRICH. 

25 In an additional embodiment, a vector expressing the complement of the polynucleotide 

encoding TRICH may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TRICH including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 

30 therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 
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An antagonist of TRICH may be produced using methods which are generally known in the art. 
In particular, purified TRICH may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TRICH. Antibodies to TRICH may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
5 not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TRICH or with any fragment or oligopeptide thereof 

10 which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

15 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

TRICH have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein. Short stretches of 
TRICH amino acids may be fused with those of another protein, such as KLH, and antibodies to the 

20 chimeric molecule may be produced. 

Monoclonal antibodies to TRICH may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-cell hybridoma technique, and the EB V-hybridoma 
technique. (See, e.g., KoMer, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

25 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.R et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, SX. et al. (1984) Proc. 

30 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TRICH-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
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generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, 
D.R. (1991) Proc. Natl. Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
5 the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRICH may also be generated. 
For example, such fragments include, but are not limited to, F(ab> 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the 
10 F(ab02 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. et al. 
(1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
15 polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TRICH and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive 
to two non-interfering TRICH epitopes is generally used, but a competitive binding assay may also be 
employed (Pound, supra) . 

20 Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 

may be used to assess the affinity of antibodies for TRICH. Affinity is expressed as an association 
constant, K^, which is defined as the molar concentration of TRICH-antibody complex divided by the 
molar concentrations of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple 

25 TRICH epitopes, represents the average affinity, or avidity, of the antibodies for TRICH. The 

determined for a preparation of monoclonal antibodies, which are monospecific for a particular TRICH 
epitope, represents a true measure of affinity. High-affinity antibody preparations with IC, ranging from 
about 10 9 to 1 0 12 L/mole are preferred for use in immunoassays in which the TRICH-antibody complex 
must withstand rigorous manipulations. Low-affinity antibody preparations with ranging from 

30 about 10 6 to 10 7 L/mole are preferred for use in immunopurification and similar procedures which 
ultimately require dissociation of TRICH, preferably in active form, from the antibody (Catty, D. 
(1988) Antibodies. Volume I: A Practical Approach , IRL Press, Washington DC; Liddell, J.E. and A. 
Cryer (1991) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, New York NY). 
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The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of TRICH-antibody 
5 complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 

antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding TRICH, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 

10 expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding TRICH. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding TRICH. 
(See, e.g., Agrawal, S., ed. (1996) Antisense Therapeutics , Humana Press Inc., TotawaNJ.) 

15 In therapeutic use, any gene delivery system suitable for introduction of the antisense 

sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellularly in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 

20 9(13):1288-1296.) Antisense sequences can also be introduced intracellularly through the use of viral 
vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra ; Uckert, W. and W. Walther (1994) Pharmacol. Hier. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, RJ. et 

25 al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14);2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TRICH may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCED)-Xl disease characterized by X-linked 
30 inheritance (Cavazzana-Calvo, M. et al. (2000) Science 238:669-672), severe combined 

immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
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hypercholesterolemia, and hemophilia resulting from Factor Vm or Factor DC deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
cell proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
5 against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescMa, E. et al. (1996) Proc. Natl. Acad Sci. USA. 93:11395-11399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruziY In the 
case where a genetic deficiency in TRICH expression or regulation causes disease, the expression of 

10 TRICH from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic; deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TRICH are treated by constructing mammalian expression vectors encoding TRICH and introducing 
these vectors by mechanical means into TRICH-deficient cells. Mechanical transfer technologies for 

15 use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic 
gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) 
the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. Biochem. 62:191- 
217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R6cipon (1998) Curr. Opin. Biotechnol. 
9:445-450). 

20 Expression vectors that may be effective for the expression of TRICH include, but are not 

limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TRICH may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 

25 (RSV), SV40 virus, thymidine kinase (TK), or (3-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. andH.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 

30 FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and Blau, H.M. supra) ), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TRICH from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
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polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, FX. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1 :841-845). The introduction of DNA to primary cells requires modification of these 
5 standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TRICH expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TRICH under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

10 element (RRE) along with additional retrovirus m-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 

15 receptors on the target ceils or a promiscuous envelope protein such as VS Vg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61:1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 

20 method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 

Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4 + T-cells), and the 
return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, U. et al. 

25 (1998) Proc. Natl. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to cells which have one or more genetic abnormalities with respect to 
the expression of TRICH. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 

30 versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. 
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Rev. Nutr. 19:51 1-544 and Verma, LM. and N. Somia (1997) Nature 18:389:239-242, both 
incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TRICH to target cells which have one or more genetic abnormalities with 
5 respect to the expression of TRICH. The use of herpes simplex virus (HS V>based vectors may be 
especially valuable for introducing TRICH to cells of the central nervous system, for which HS V has a 
tropism. The construction and packaging of herpes-based vectors are well known to those with 
ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye Res. 

10 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,41 3 teaches the use of recombinant HSV 
d92 which consists of a genome containing at least one exogenous gene to be transferred to a cell under 
the control of the appropriate promoter for purposes including human gene therapy. Also taught by this 

15 patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. 
For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. (1994) Dev. 
Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus 
sequences, the generation of recombinant virus following the transfection of multiple plasmids 
containing different segments of the large herpesvirus genomes, the growth and propagation of 

20 herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of ordinary 
skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding TRICH to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on 

25 the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotechnol. 9:464-469). During 

alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full length genomic RNA, resulting 
in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., 
protease and polymerase). Similarly, inserting the coding sequence for TRICH into the alphavirus 

30 genome in place of the capsid-coding region results in the production of a large number of TRICH- 
coding RNAs and the synthesis of high levels of TRICH in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
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(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of TRICH into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 
5 performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 

10 transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt. KiscoNY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes, 

15 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 

RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TRICH. 

20 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 

25 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 

30 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
encoding TRICH. Such DNA sequences may be incorporated into a wide variety of vectors with 
suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or 
tissues. 
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RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
5 extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TRICH. 

10 Compounds which may be effective in altering expression of a specific polynucleotide may include, 
but are not limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming 
oligonucleotides, transcription factors and other polypeptide transcriptional regulators, and non- 
macromolecular chemical entities which are capable of interacting with specific polynucleotide 
sequences. Effective compounds may alter polynucleotide expression by acting as either inhibitors or 

15 promoters of polynucleotide expression. Thus, in the treatment of disorders associated with increased 
TRICH expression or activity, a compound which specifically inhibits expression of the 
polynucleotide encoding TRICH may be therapeutically useful, and in the treatment of disorders 
associated with decreased TRICH expression or activity, a compound which specifically promotes 
expression of the polynucleotide encoding TRICH may be therapeutically useful. 

20 At least one, and up to a plurality, of test compounds may be screened for effectiveness in 

altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 

25 based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding TRICH is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-firee or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding TRICH are assayed 

30 by any method commonly known in the art. Typically, the expression of a specific nucleotide is 

detected by hybridization with a probe having a nucleotide sequence complementary to the sequence 
of the polynucleotide encoding TRICH. The amount of hybridization may be quantified, thus 
forming the basis for a comparison of the expression of the polynucleotide both with and without 
exposure to one or more test compounds. Detection of a change in the expression of a polynucleotide 

35 exposed to a test compound indicates that the test compound is effective in altering the expression of 
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the polynucleotide. A screen for a compound effective in altering expression of a specific 
polynucleotide can be carried out, for example, using a Schizosaccharomvces pombe gene expression 
system (Atkins, D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, G.M. et al. (2000) Nucleic Acids 
Res. 28:E15) or aimman cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. 
5 Res. Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 
nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. 
Patent No. 6,022,691). 

10 Many methods for introducing vectors into cells or tissues are available and equally suitable for 

use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 

15 Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

An additional embodiment of the invention relates to the administration of a composition which 
20 generally comprises an active ingredient formulated with a pharmaceutical^ acceptable excipient. 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of TRICH, 
antibodies to TRICH, and mimetics, agonists, antagonists, or inhibitors of TRICH. 
25 The compositions utilized in this invention may be administered by any number of routes 

including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositions for pulmonary administration may be prepared in liquid or dry powder form. 
30 These compositions are generally aerosolized immediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art. In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar, region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, e.g., Patton, I.S. et al., U.S. 
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Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
5 an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositions may be prepared for direct intracellular delivery of 
macromolecules comprising TRICH or fragments thereof. For example, liposome preparations 
containing a cell-impermeable macromolecule may promote cell ftision and intracellular delivery of the 
macromolecule. Alternatively, TRICH or a fragment thereof may be joined to a short cationic N- 
10 terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to 

transduce into the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. et 
al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
15 or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example TRICH 
or fragments thereof, antibodies of TRICH, and agonists, antagonists or inhibitors of TRICH, which 
20 ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 5O /ED 50 ratio. Compositions which exhibit large 
25 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that includes the ED 50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 
30 The exact dosage will be determined by the practitioner, in light of factors related to the subject 

requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction sensitivities, and response to therapy. 
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Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half -life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 Mg to 100,000 //g, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
5 methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or then- 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

10 In another embodiment, antibodies which specifically bind TRICH may be used for the 

diagnosis of disorders characterized by expression of TRICH, or in assays to monitor patients being 
treated with TRICH or agonists, antagonists, or inhibitors of TRICH. Antibodies useful for diagnostic 
purposes may be prepared in the same manner as described above for therapeutics. Diagnostic assays 
for TRICH include methods which utilize the antibody and a label to detect TRICH in human body 

15 fluids or in extracts of cells or tissues. The antibodies may be used with or without modification, and 
may be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring TRICH, including ELIS As, RIAs, and FACS, are known 
in the art and provide a basis for diagnosing altered or abnormal levels of TRICH expression. Normal 

20 or standard values for TRICH expression are established by combining body fluids or cell extracts 

taken from normal mammalian subjects, for example, human subjects, with antibodies to TRICH under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TRICH expressed in 
subject, control, and disease samples from biopsied tissues are compared with the standard values. 

25 Deviation between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TRICH may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of TRICH may be correlated with 

30 disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TRICH, and to monitor regulation of TRICH levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TRICH or closely related molecules may be used to 
identify nucleic acid sequences which encode TRICH. The specificity of the probe, whether it is made 
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from a highly specific region, e.g., the 5 5 regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TRICH, allelic variants, or related 
sequences. 

5 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TRICH encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:33-64 or from 
genomic sequences including promoters, enhancers, and introns of the TRICH gene. 

Means for producing specific hybridization probes for DN As encoding TRICH include the 

10 cloning of polynucleotide sequences encoding TRICH or TRICH derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a variety 
of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, such as 

15 alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRICH may be used for the diagnosis of disorders 
associated with expression of TRICH. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 

20 insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis, 

normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer, cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 

25 cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 
infectious myositis, polymyositis, neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia, and other disorders associated with transport, e.g., neurofibromatosis, 

30 postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and 
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Fanconi disease; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
5 sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and 
Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 

10 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system including Down syndrome, cerebral palsy, neuroskeletal disorders, autonomic 
nervous system disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis, 
inherited, metabolic, endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental 

15 disorders including mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), 
akathesia, amnesia, catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, 
postherpetic neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, 
and familial frontotemporal dementia; a muscle disorder such as cardiomyopathy, myocarditis, 
Duchenne's muscular dystrophy, Becker's muscular dystrophy, myotonic dystrophy, central core 

20 disease, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, 
infectious myositis, polymyositis, dermatomyositis, inclusion body myositis, thyrotoxic myopathy, 
ethanol myopathy, angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, pheochromocytoma, and 
myopathies including encephalopathy, epilepsy, Keams-Sayre syndrome, lactic acidosis, myoclonic 

25 disorder, ophthalmoplegia, and acid maltase deficiency (AMD, also known as Pompe's disease); an 
immunological disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, 
adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune 
polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, cholecystitis, contact 

30 dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, polymyositis, 

35 psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic 
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anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, ulcerative 
colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and extracorporeal 
circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and trauma; and a 
cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
5 hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, 
ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 

10 prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus. The polynucleotide 
sequences encoding TRICH may be used in Southern or northern analysis, dot blot, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-like 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered TRICH expression. 
Such qualitative or quantitative methods are well known in the art. 

15 In a particular aspect, the nucleotide sequences encoding TRICH may be useful in assays that 

detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TRICH may be labeled by standard methods and added to a fluid or tissue sample 
from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 
incubation period, the sample is washed and the signal is quantified and compared with a standard 

20 value. If the amount of signal in the patient sample is significantly altered in comparison to a control 
sample then the presence of altered levels of nucleotide sequences encoding TRICH in the sample 
indicates the presence of the, associated disorder. Such assays may also be used to evaluate the efficacy 
of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the 
treatment of an individual patient. 

25 In order to provide a basis for the diagnosis of a disorder associated with expression of TRICH, 

a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding TRICH, under conditions suitable for hybridization or amplification. 
Standard hybridization may be quantified by comparing the values obtained from normal subjects with 

30 values from an experiment in which a known amount of a substantially purified polynucleotide is used. 
Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 
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Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
5 days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 

10 preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding TRICH 
may involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, 
or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding 

15 TRICH, or a fragment of a polynucleotide complementary to the polynucleotide encoding TRICH, and 
will be employed under optimized conditions for identification of a specific gene or condition. 
Oligomers may also be employed under less stringent conditions for detection or quantification of 
closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 

20 encoding TRICH may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 

substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TRICH are used to amplify DNA using the 

25 polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 
tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the secondary 
and tertiary structures of PCR products in single-stranded form, and these differences are detectable 
using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers are 
fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such as 

30 DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico SNP 
(isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 
overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
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alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the high 
throughput MASS ARRAY system (Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of TRICH include radiolabeling or 
biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
5 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

10 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described below. The microarray may also be used to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 

15 function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 

progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 

20 display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, TRICH, fragments of TRICH, or antibodies specific for TRICH may 
be used as elements on a microarray. The microarray may be used to monitor or measure protein- 
protein interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 

25 generate a transcript image of a tissue or cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference herein.) Thus a transcript image may be generated by 

30 hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 
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Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo , as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 

Transcript images which profile the expression of the polynucleotides of the present invention 
5 may also be used in conjunction with in vitro model systems and preclinical evaluation of 

pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
(Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and NX. Anderson (2000) 

10 Toxicol. Lett. 1 12-1 13:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome- wide measurement of expression 
provides the highest quality signature. Even genes whose expression is not altered by any tested 

15 compounds are important as well , as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 

20 example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at http://www.niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 
25 containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 
levels in an untreated biological sample. Differences in the transcript levels between the two samples 
30 are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the proteome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 
can be subjected individually to ftirther analysis. Proteome expression patterns, or profiles, are 
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analyzed by quantifying the number of expressed proteins and their relative abundance undo: given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel electrophoresis, in which proteins from a sample are separated by 
5 isoelectric focusing in the first dimension, and then according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gel as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protein spot is 
generally proportional to the level of the protein in the sample. The optical densities of equivalently 

10 positioned protein spots from different samples, for example, from biological samples either treated or 
untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment. The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 

15 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for TRICH to quantify the 
levels of TRICH expression. In one embodiment, the antibodies are used as elements on a microarray, 
and protein expression levels are quantified by exposing the microarray to the sample and detecting the 

20 levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element. 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 

25 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 

30 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample are separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protein in an untreated biological sample. A difference 
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in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 
In another embodiment, the toxicity of a test compound is assessed by treating a biological 
5 sample containing proteins with the test compound. Proteins from the biological sample are incubated 
with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

10 Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/25 11 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
2155; and Heller, MJ. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 

15 known and thoroughly described in DNA Microarrays: A Practical Approach , M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TRICH may be used 
to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 

20 preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 

25 constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J. J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, for 

30 example, Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 
data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OMIM) 
World Wide Web site. Correlation between the location of the gene encoding TRICH on a physical 
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map and a specific disorder, or a predisposition to a specific disorder, may help define the region of 
DN A associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
5 Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-teiangiectasia to llq22-23, any sequences 
10 mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R. A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRICH, its catalytic or immunogenic fragments, or 
15 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between TRICH and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
20 having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TRICH, or fragments thereof, 
and washed. Bound TRICH is then detected by methods well known in the art. Purified TRICH can 
also be coated directly onto plates for use in the aforementioned drug screening techniques. 
25 Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TRICH specifically compete with a test compound for binding TRICH. 
In this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
30 antigenic determinants with TRICH. 

In additional embodiments, the nucleotide sequences which encode TRICH may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 
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Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following embodiments are, 
therefore, to be construed as merely illustrative, and not limitative of the remainder of the disclosure 
in any way whatsoever. 

5 The disclosures of all patents, applications and publications, mentioned above and below 

including U.S. Ser. No. 60/216,547, U.S. Ser. No. 60/218,232, U.S. Sen No. 60/220,112, and U.S. 
Ser. No. 60/221,839 are expressly incorporated by reference herein., are expressly incorporated by 
reference herein. 

10 EXAMPLES 
I. Construction of cDNA Libraries 

Incyte cDNAs were derived from cDN A libraries described in the LIFESEQ GOLD database 
(Incyte Genomics, Palo Alto CA) and shown in Table 4, column 5. Some tissues were homogenized 
and lysed in guanidinium isothiocyanate, while others were homogenized and lysed in phenol or in a 

15 suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic solution of phenol 
and guanidine isothiocyanate. The resulting lysates were centrifuged over CsCl cushions or extracted 
with chloroform. RNA was precipitated from the lysates with either isopropanol or sodium acetate and 
ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

20 purity. In some cases, RNA was treated with DNase. For most libraries, poly(A)+ RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chats worth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

25 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

30 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
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PBLUESCREFT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), PCDNA2.1 plasmid 
(Invitrogen, Carlsbad CA), PBK-CMV plasmid (Stratagene), or pINCY (Incyte Genomics, Palo Alto 
CA), or derivatives thereof. Recombinant plasmids were transformed into competent E. coli cells 
including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or ElectroMAX 
5 DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNEAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 
one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an AGTC 

10 Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.AX. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 

15 high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 

cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 

20 III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 

25 MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 

using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were 
carried out using the MEGABACE 1000 DNA sequencing system (Molecular Dynamics); the ABI 

30 PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
supra , unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VIII. 
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The polynucleotide sequences derived from Incyte cDNAs were validated by removing vector, 
linker, and poly(A) sequences and by masking ambiguous bases, using algorithms and programs based 
on BLAST, dynamic programming, and dinucleotide nearest neighbor analysis. The Incyte cDNA 
sequences or translations thereof were then queried against a selection of public databases such as the 
5 GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases, and BLOCKS, PRINTS, 
DOMO, PRODOM, and hidden Markov model (HMM)-based protein family databases such as PFAM. 
(HMM is a probabilistic approach which analyzes consensus primary structures of gene families. 
See, for example, Eddy, S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) The queries were 
performed using programs based on BLAST, FASTA, BLIMPS, and HMMER. The Incyte cDNA 

10 sequences were assembled to produce full length polynucleotide sequences. Alternatively, GenBank 
cDNAs, GenBank ESTs, stitched sequences, stretched sequences, or Genscan-predicted coding 
sequences (see Examples IV and V) were used to extend Incyte cDNA assemblages to lull length. 
Assembly was performed using programs based on Phred, Phrap, and Consed, and cDNA assemblages 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 

15 The full length polynucleotide sequences were translated to derive the corresponding full length 

polypeptide sequences. Alternatively, a polypeptide of the invention may begin at any of the methionine 
residues of the full length translated polypeptide. Full length polypeptide sequences were subsequently 
analyzed by querying against databases such as the GenBank protein databases (genpept), SwissProt, 
BLOCKS, PRINTS, DOMO, PRODOM, Prosite, and hidden Markov model (HMM)-based protein 

20 family databases such as PFAM. Full length polynucleotide sequences are also analyzed using 
MACDNASIS PRO software (Hitachi Software Engineering, South San Francisco CA) and 
LASER GENE software (DNASTAR). Polynucleotide and polypeptide sequence alignments are 
generated using default parameters specified by the CLUSTAL algorithm as incorporated into the 
MEGALIGN multisequence alignment program (DNASTAR), which also calculates the percent 

25 identity between aligned sequences. 

Table 7 summarizes the tools, programs, and algorithms used for the analysis and assembly of 
Incyte cDNA and full length sequences and provides applicable descriptions, references, and threshold 
parameters. The first column of Table 7 shows the tools, programs, and algorithms used, the second 
column provides brief descriptions thereof, the third column presents appropriate references, all of 

30 which are incorporated by reference herein in their entirety, and the fourth column presents, where 

applicable, the scores, probability values, and other parameters used to evaluate the strength of a match 
between two sequences (the higher the score or the lower the probability value, the greater the identity 
between two sequences). 
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The programs described above for the assembly and analysis of full length polynucleotide and 
polypeptide sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO:33-64. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies are described in Table 4, column 4. 
5 IV. Identification and Editing of Coding Sequences from Genomic DNA 

Putative transporters and ion channels were initially identified by running the Genscan gene 
identification program against public genomic sequence databases (e.g., gbpri and gbhtg). Genscan is a 
general-purpose gene identification program which analyzes genomic DNA sequences from a variety of 
organisms (See Burge, C. and S. Karlin (1997) J. Mol. Biol. 268:78-94, and Burge, C. and S. Karlin 

10 (1998) Curr. Opin. Struct. Biol. 8:346-354). The program concatenates predicted exons to form an 
assembled cDNA sequence extending from a methionine to a stop codon. The output of Genscan js a 
FASTA database of polynucleotide and polypeptide sequences. The maximum range of sequence for 
Genscan to analyze at once was set to 30 kb. To determine which of these Genscan predicted cDNA 
sequences encode transporters and ion channels, the encoded polypeptides were analyzed by querying 

15 against PFAM models for transporters and ion channels. Potential transporters and ion channels were 
also identified by homology to Incyte cDNA sequences that had been annotated as transporters and ion 
channels. These selected Genscan-predicted sequences were then compared by BLAST analysis to the 
genpept and gbpri public databases. Where necessary, the Genscan-predicted sequences were then 
edited by comparison to the top BLAST hit from genpept to correct errors in the sequence predicted by 

20 Genscan, such as extra or omitted exons. BLAST analysis was also used to find any Incyte cDNA or 
public cDNA coverage of the Genscan-predicted sequences, thus providing evidence for transcription. 
When Incyte cDNA coverage was available, this information was used to correct or confirm the 
Genscan predicted sequence. Full length polynucleotide sequences were obtained by assembling 
Genscan-predicted coding sequences with Incyte cDNA sequences and/or public cDNA sequences using 

25 the assembly process described in Example III. Alternatively, full length polynucleotide sequences were 
derived entirely from edited or unedited Genscan-predicted coding sequences. 
V. Assembly of Genomic Sequence Data with cDNA Sequence Data 
"Stitched" Sequences 

Partial cDNA sequences were extended with exons predicted by the Genscan gene identification 
30 program described in Example IV. Partial cDNAs assembled as described in Example HI were mapped 
to genomic DNA and parsed into clusters containing related cDNAs and Genscan exon predictions from 
one or more genomic sequences. Each cluster was analyzed using an algorithm based on graph theory 
and dynamic programming to integrate cDNA and genomic information, generating possible splice 
variants that were subsequently confirmed, edited, or extended to create a full length sequence. 
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Sequence intervals in which the entire length of the interval was present on more than one sequence in 
the cluster were identified, and intervals thus identified were considered to be equivalent by transitivity. 
For example, if an interval was present on a cDNA and two genomic sequences, then all three intervals 
were considered to be equivalent. This process allows unrelated but consecutive genomic sequences to 
5 be brought together, bridged by cDNA sequence. Intervals thus identified were then "stitched" together 
by the stitching algorithm in the order that they appear along their parent sequences to generate the 
longest possible sequence, as well as sequence variants. Linkages between intervals which proceed 
along one type of parent sequence (cDNA to cDNA or genomic sequence to genomic sequence) were 
given preference over linkages which change parent type (cDNA to genomic sequence). The resultant 
10 stitched sequences were translated and compared by BLAST analysis to the genpept and gbpri public 
databases. Incorrect exons predicted by Genscan were corrected by comparison to the top BLAST hit 
from genpept. Sequences were further extended with additional cDNA sequences, or by inspection of 
genomic DNA, when necessary. 
"Stretched" Sequences 

15 Partial DNA sequences were extended to full length with an algorithm based on BLAST 

analysis. First, partial cDNAs assembled as described in Example III were queried against public 
databases such as the GenBank primate, rodent, mammalian, vertebrate, and eukaryote databases using 
the BLAST program. The nearest GenBank protein homolog was then compared by BLAST analysis 
to either Incyte cDNA sequences or GenScan exon predicted sequences described in Example IV. A 

20 chimeric protein was generated by using the resultant high-scoring segment pairs (HSPs) to map the 
translated sequences onto the GenBank protein homolog. Insertions or deletions may occur in the 
chimeric protein with respect to the original GenBank protein homolog. The GenBank protein homolog, 
the chimeric protein, or both were used as probes to search for homologous genomic sequences from the 
public human genome databases. Partial DNA sequences were therefore "stretched" or extended by the 

25 addition of homologous genomic sequences. The resultant stretched sequences were examined to 
determine whether it contained a complete gene. 

VI. Chromosomal Mapping of TRICH Encoding Polynucleotides 

The sequences which were used to assemble SEQ ID NO:33-64 were compared with 
sequences from the Incyte LEFESEQ database and public domain databases using BLAST and other 
30 implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
SEQ ID NO:33-64 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 7). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
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had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

Map locations are represented by ranges, or intervals, of human chromosomes. The map 
position of an interval, in centiMorgans, is measured relative to the terminus of the chromosome's p- 
5 arm. (The centiMorgan (cM) is a unit of measurement based on recombination frequencies between 
chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase (Mb) of DNA in 
humans, although this can vary widely due to hot and cold spots of recombination.) The cM 
distances are based on genetic markers mapped by Genethon which provide boundaries for radiation 
hybrid markers whose sequences were included in each of the clusters. Human genome maps and 

10 other resources available to the public, such as the NCBI ,, GeneMap , 99" World Wide Web site 
(http.//www .ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 
VII. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

15 and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel (1995) 
supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LEFESEQ (Incyte Genomics). This analysis is 
20 much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer, 
search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 

25 5 x minimum {length(Seq. 1), length(Seq. 2)} 

The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
30 by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 

assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 
every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment. For 
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example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 100% identity and 
70% overlap at one end, or by 88% identity and 100% overlap at the other. A product score of 50 is 
produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 
5 Alternatively, polynucleotide sequences encoding TRICH are analyzed with respect to the 

tissue sources from which they were derived. For example, some full length sequences are assembled, 
at least in part, with overlapping Incyte cDNA sequences (see Example III). Each cDNA sequence is 
derived from a cDNA library constructed from a human tissue. Each human tissue is classified into one 
of the following organ/tissue categories: cardiovascular system; connective tissue; digestive system; 

10 embryonic structures; endocrine system; exocrine glands; genitalia, female; genitalia, male; germ cells; 
hemic and immune system; liver; musculoskeletal system; nervous system; pancreas; respiratory 
system; sense organs; skin; stomatognathic system; unclassified/mixed; or urinary tract. The number of 
libraries in each category is counted and divided by the total number of libraries across all categories. 
Similarly, each human tissue is classified into one of the following disease/condition categories: cancer, 

15 cell line, developmental, inflammation, neurological, trauma, cardiovascular, pooled, and other, and the 
number of libraries in each category is counted and divided by the total number of libraries across all 
categories. The resulting percentages reflect the tissue- and disease-specific expression of cDNA 
encoding TRICH. cDNA sequences and cDNA library/tissue information are found in the LIFESEQ 
GOLD database (Incyte Genomics, Palo Alto CA). 

20 VIII. Extension of TRICH Encoding Polynucleotides 

Full length polynucleotide sequences were also produced by extension of an appropriate 
fragment of the full length molecule using oligonucleotide primers designed from this fragment. One 
primer was synthesized to initiate 5' extension of the known fragment, and the other primer was 
synthesized to initiate 3* extension of the known fragment. The initial primers were designed using 

25 OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target sequence 
at temperatures of about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one extension 

30 was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 
was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , 
and 2-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
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(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1: 94°C, 3 min; Step 2: 
5 94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 °C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 \il PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 |il of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 

10 Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsysteihs Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 fA to 10 pd aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfected into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 °C in 384-well plates in LB/2x 
carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
25 Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following parameters: Step 1 : 

94 °C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; Step 5: steps 2, 3, and 4 
repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1:2, 
30 v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYENAMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
sequencing ready reaction kit (Applied Biosystems). 
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In like manner, full length polynucleotide sequences are verified using the above procedure or 
are used to obtain 5' regulatory sequences using the above procedure along with oligonucleotides 
designed for such extension, and an appropriate genomic library. 

IX. Labeling and Use of Individual Hybridization Probes 

5 Hybridization probes derived from SEQ ID NO:33-64 are employed to screen cDNAs, genomic 

DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 yuCi of [y- 32 P] adenosine 

10 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPHADEX G-25 superfine size 
exclusion dextran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl n, Eco RI, Pst I, Xba I, or 

15 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40 °C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

20 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

X. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra .), mechanical 

25 microspotting technologies, and derivatives thereof. The substrate in each of the aforementioned 

technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 

30 produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of elements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31.) 
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Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
5 biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
10 the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly (A) + RNA sample is 

15 reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/pl oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/pl RNase inhibitor, 500 pM dATP, 500 pM dGTP, 500 pM dTTP, 40 pM 
dCTP, 40 pM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMB RIGHT kits (Incyte). Specific control poly(A) + RNAs are synthesized by in vitro transcription 

20 from non-coding yeast genomic DNA. After incubation at 37° C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85° C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 

25 using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSC/0.2% SDS. 
Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 

30 amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1 -2 ng to a final quantity greater than 5 
pg. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 
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Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
5 coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 

10 apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60° C followed by washes in 

15 0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 jug each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65° C for 5 minutes and is aliquoted onto the microarray surface and covered with 

20 an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 jul of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60° C. The arrays are washed for 10 min at 45° C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45°C in a second wash buffer (0.1X SSC), and dried. 

25 Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 

30 containing the array is placed on a computer-controlled X- Y stage on the microscope and raster- 
scanned past the objective. The 1.8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 

35 Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
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filters positioned between the array and the photomultiplier tubes are used to filter the signals. Hie 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 
5 The sensitivity of the scans is typically calibrated using the signal intensity generated by a 

cDN A control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 : 100, 000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 

10 fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 

15 computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue 0ow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

20 A grid is superimposed over the fluorescence signal image such that the signal from each spot 

is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
XL Complementary Polynucleotides 

25 Sequences complementary to the TRICH-encoding sequences, or any parts thereof, are used to 

detect, decrease, or inhibit expression of naturally occurring TRICH. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the co6ing sequence of TRICH. To inhibit transcription, a 

30 complementary oligonucleotide is designed from the most unique 5 ' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the TRICH-encoding transcript. 
XII. Expression of TRICH 



80 



NSDOCID: <WO, 0204520A2J_> 



WO 02/04520 



PCT/US01/21448 



Expression and purification of TRICH is achieved using bacterial or virus-based expression 
systems. For expression of TRICH in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDN A 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 
5 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TRICH upon induction with isppropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TRICH in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 

10 (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding TRICH by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 

15 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. et 
al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. GeneTher. 
7:1937-1945.) 

In most expression systems, TRICH is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

20 affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosoma japonicum , enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolytically cleaved from TRICH at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 

25 using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chelate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 16). Purified TRICH obtained by these methods can be used directly in the assays shown in 
Examples XVI, XVII, and XVm where applicable. 

30 XIII. Functional Assays 

TRICH function is assessed by expressing the sequences encoding TRICH at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include PCMV SPORT (Life Technologies) and PCR3.1 (Invitrogen, Carlsbad CA), both of which 
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contain the cytomegalovirus promoter. 5-10 peg of recombinant vector are transiently transfected into a 
human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 /zg of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
5 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 

10 fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DN A content as measured by staining of DN A with propidium iodide; changes in 
cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific antibodies; 

15 and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
(1994) Flow Cytometry , Oxford, New York NY. 

The influence of TRICH on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding TRICH and either CD64 or CD64-GFP. CD64 and 

20 CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRN A can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRNA encoding TRICH and other genes of interest can be analyzed by northern analysis or 

25 microarr ay techniques . 

XTV. Production of TRICH Specific Antibodies 

TRICH substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:48S-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

30 Alternatively, the TRICH amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 
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Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- 
Aldrich, St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccininiide ester (MBS) to 
increase immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the 
oligopeptide-KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide 
and anti-TRICH activity by, for example, binding the peptide or TRICH to a substrate, blocking with 
1 % BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

XV. Purification of Naturally Occurring TRICH Using Specific Antibodies 
Naturally occurring or recombinant TRICH is substantially purified by immunoaffinity 

chromatography using antibodies specific for TRICH. An immunoaffinity column is constructed by 
covalently coupling anti-TRICH antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Aniersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TRICH are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of TRICH (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/TRICH binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 
as urea or thiocyanate ion), and TRICH is collected. 

XVI, Identification of Molecules Which Interact with TRICH 

Molecules which interact with TRICH may include transporter substrates, agonists or 
antagonists, modulatory proteins such as Gfiy proteins (Reimann, supra) or proteins involved in TRICH 
localization or clustering such as MAGUKs (Craven, supra ). TRICH, or biologically active fragments 
thereof, are labeled with 125 I Bolton-Hunter reagent. (See, e.g., Bolton A.E. and W.M. Hunter (1973) 
Biochem. J. 133:529-539.) Candidate molecules previously arrayed in the wells of a multi-well plate 
are incubated with the labeled TRICH, washed, and any wells with labeled TRICH complex are 
assayed. Data obtained using different concentrations of TRICH are used to calculate values for the 
number, affinity, and association of TRICH with the candidate molecules. 

Alternatively, proteins that interact with TRICH are isolated using the yeast 2-hybrid system 
(Fields, S. and O. Song (1989) Nature 340:245-246). TRICH, or fragments thereof, are expressed as 
fusion proteins with the DNA binding domain of Gal4 or lexA, and potential interacting proteins are 
expressed as fusion proteins with an activation domain. Interactions between the TRICH fusion protein 
and the TRICH interacting proteins (fusion proteins with an activation domain) reconstitute a 
transactivation function that is observed by expression of a reporter gene. Yeast 2-hybrid systems are 
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commercially available, and methods for use of the yeast 2-hybrid system with ion channel proteins are 
discussed in Niethammer, M. and M. Sheng (1998, Meth. Enzymol. 293:104-122). 

TRICH may also be used in the PATHCALLING process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
5 between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

Potential TRICH agonists or antagonists may be tested for activation or inhibition of TRICH 
ion channel activity using the assays described in section XVIII. 
XVII. Demonstration of TRICH Activity 

10 Ion channel activity of TRICH is demonstrated using an electrophysiological assay for ion 

conductance. TRICH can be expressed by transforming a mammalian cell line such as COS7, HeLa 
or CHO with a eukaryotic expression vector encoding TRICH. Eukaryotic expression vectors are 
commercially available, and the techniques to introduce them into cells are well known to those 
skilled in the art A second plasmid which expresses any one of a number of marker genes, such as B- 

15 galactosidase, is co-transformed into the cells to allow rapid identification of those cells which have 
taken up and expressed the foreign DNA. The cells are incubated for 48-72 hours after transformation 
under conditions appropriate for the cell line to allow expression and accumulation of TRICH and 6- 
galactosidase. 

Transformed cells expressing fi-galactosidase are stained blue when a suitable colorimetric 

20 substrate is added to the culture media under conditions that are well known in the art. Stained cells 
are tested for differences in membrane conductance by electrophysiological techniques that are well 
known in the art. Untransformed cells, and/or cells transformed with either vector sequences alone or 
B-galactosidase sequences alone, are used as controls and tested in parallel. Cells expressing TRICH 
will have higher anion or cation conductance relative to control cells. The contribution of TRICH to 

25 conductance can be confirmed by incubating the cells using antibodies specific for TRICH. The 

antibodies will bind to the extracellular side of TRICH, thereby blocking the pore in the ion channel, 
and the associated conductance. 

Alternatively, ion channel activity of TRICH is measured as current flow across a TRICH- 
containing Xenopus laevis oocyte membrane using the two-electrode voltage-clamp technique (Ishi et 

30 aL, supra ; Jegla, T. and L. Salkoff (1997) J. Neurosci. 17:32-44). TRICH is subcloned into an 

appropriate Xenopus oocyte expression vector, such as pBF, and 0.5-5 ng of mRNA is injected into 
mature stage IV oocytes. Injected oocytes are incubated at 18°C for 1-5 days. Inside-out 
macropatches are excised into an intracellular solution containing 116 mM K-gluconate, 4 mM KC1, 
and 10 mM Hepes (pH 7.2). The intracellular solution is supplemented with varying concentrations 

35 of the TRICH mediator, such as cAMP, cGMP, or Ca +2 (in the form of CaCl^, where appropriate. 
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Electrode resistance is set at 2-5 MQ and electrodes are filled with the intracellular solution lacking 
mediator. Experiments are performed at room temperature from a holding potential of 0 mV. Voltage 
ramps (2.5 s) from -100 to 100 mV are acquired at a sampling frequency of 500 Hz. Current measured 
is proportional to the activity of TRICH in the assay. 
5 In particular, the activities of TRICH-1, TRICH-2, and TRICH-10, are measured as K + 

conductance, the activities of TRICH-6 and TRICH-9 are measured as K + conductance in the presence 
of membrane stretch or free fatty acids, the activities of TRICH-18, TRICH-25 and TRICH-31 are 
measured as voltage-gated K + conductance, TRICH-5 activity is measured as CI" conductance in the 
presence of GAB A, TRICH- 1 1 activity is measured as cation conductance in the presence of heat, and 

10 the activity of TRICH-9, TRICH-28 is measured as Ca 2+ conductance. 

Transport activity of TRICH is assayed by measuring uptake of labeled substrates into 
Xenopus laevis oocytes. Oocytes at stages V and VI are injected with TRICH mRNA (10 ng per 
oocyte) and incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KC1, ImM CaCl 2 , 
ImM MgCl 2 , ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50fig/ml gentamycin, pH 7.8) to allow 

15 expression of TRICH. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM 
KC1, ImM CaCl 2 , ImM MgCl 2 , 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., 
amino acids, sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. 
radiolabeled with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 
minutes, uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 

20 incorporated label, and comparing with controls. TRICH activity is proportional to the level of 
internalized labeled substrate. In particular, test substrates include pigment precursors and related 
molecules for TRICH-3, aminophospholipids for TRICH-4, fructose and glucose for TRICH-7 and 
TRICH-15, amino acids for TRICH-8, Na + and iodide for TRICH- 12, Na + and H + for TRICH- 13 and 
TRICH-21, Na + and glucose for TRICH-16 and TRICH-19, and glucose for TRICH-23, TRICH-26, 

25 TRICH-29, TRICH-30, and TRICH-32. 

ATPase activity associated with TRICH can be measured by hydrolysis of radiolabeled ATP- 
[y- 32 P], separation of the hydrolysis products by chromatographic methods, and quantitation of the 
recovered 32 P using a scintillation counter. The reaction mixture contains ATP-[y- 32 P] and varying 
amounts of TRICH in a suitable buffer incubated at 37 °C for a suitable period of time. The reaction is 

30 terminated by acid precipitation with trichloroacetic acid and then neutralized with base, and an aliquot 
of the reaction mixture is subjected to membrane or filter paper-based chromatography to separate the 
reaction products. The amount of 32 P liberated is counted in a scintillation counter. The amount of 
radioactivity recovered is proportional to the ATPase activity of TRICH in the assay, 
XVIII. Identification of TRICH Agonists and Antagonists 
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TRICH is expressed in a eukaryotic cell line such as CHO (Chinese Hamster Ovary) or HEK 
(Human Embryonic Kidney) 293. Ion channel activity of the transformed cells is measured in the 
presence and absence of candidate agonists or antagonists. Ion channel activity is assayed using patch 
clamp methods well known in the art or as described in Example XVII. Alternatively, ion channel 
5 activity is assayed using fluorescent techniques that measure ion flux across the cell membrane 
(Velicelebi, G. et al. (1999) Meth. Enzymol. 294:20-47; West, M.R. and C.R. Molloy (1996) Anal. 
Biochem. 241 :5 1 -58). These assays may be adapted for high-throughput screening using microplates. 
Changes in internal ion concentration are measured using fluorescent dyes such as the Ca 2+ indicator 
Fluo-4 AM, sodium-sensitive dyes such as SBFI and sodium green, or the CI" indicator MQAE (all 

10 available from Molecular Probes) in combination with the FLIPR fluorimetric plate reading system 

(Molecular Devices). In a more generic version of this assay, changes in membrane potential caused by 
ionic flux across the plasma membrane are measured using oxonyl dyes such as DiB AC 4 (Molecular 
Probes), DiB AC 4 equilibrates between the extracellular solution and cellular sites according to the 
cellular membrane potential. The dye's fluorescence intensity is 20-fold greater when bound to 

15 hydrophobic intracellular sites, allowing detection of DiBAC 4 entry into the cell (Gonzalez, JJE. and 
P.A. Negulescu (1998) Curr. Opin. Biotechnol. 9:624-631). Candidate agonists or antagonists may be 
selected from known ion channel agonists or antagonists, peptide libraries, or combinatorial chemical 
libraries. 

20 Various modifications and variations of the described methods and systems of the invention will 

be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 

25 to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . An isolated polypeptide selected from the group consisting of: 

a) a polypeptide comprising an amino acid sequence selected from the group consisting of 
5 SEQIDNO:l-32 ? 

b) a polypeptide comprising a naturally occurring amino acid sequence at least 90% identical 
to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32, 

c) a biologically active fragment of a polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32, and 

10 d) an immunogenic fragment of a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-32. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

32. 

15 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

20 5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 

NO:33-64. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

25 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

30 9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

35 b) recovering the polypeptide so expressed. 

13"l 



<WO 0204520A2_I_> 



WO 02/04520 



PCT/US01/21448 



10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1 . An isolated polynucleotide selected from the group consisting of: 

a) a polynucleotide comprising a polynucleotide sequence selected from the group consisting 
5 of SEQ ED NO:33-64, 

b) a polynucleotide comprising a naturally occurring polynucleotide sequence at least 90% 
identical to a polynucleotide sequence selected from the group consisting of SEQ ID NO:33-64, 

c) a polynucleotide complementary to a polynucleotide of a), 

d) a polynucleotide complementary to a polynucleotide of b), and 
10 e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 11. 

15 13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 

having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 

20 complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 

25 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

30 b) detecting the presence or absence of said amplified target polynucleotide or fragment 

thereof, and, optionally, if present, the amount thereof. 

16. A composition comprising a polypeptide of claim 1 and a pharmaceutically acceptable 
excipient. 

35 
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17. A composition of claim 16, wherein the polypeptide has an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-32. 

18. A method for treating a disease or condition associated with decreased expression of 

5 functional TRICH, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

19. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1, the method comprising: 

10 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutically acceptable excipient. 

15 

21. A method for treating a disease or condition associated with decreased expression of 
functional TRICH, comprising administering to a patient in need of such treatment a composition of 
claim 20. 

20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
TRICH, comprising administering to a patient in need of such treatment a composition of claim 23. 

30 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 
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b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
5 claim 1, said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1, 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
10 with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change in 

the activity of ihe polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
15 polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, undo: conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

20 c) comparing the expression of the target polynucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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29. A diagnostic test for a condition or disease associated with the expression of TRICH in a 
biological sample comprising the steps of: 

a) combining the biological sample with an antibody of claim 10, under conditions suitable 
5 for the antibody to bind the polypeptide and form an antibody polypeptide complex; and 

b) detecting the complex, wherein the presence of the complex correlates with the presence 
of the polypeptide in the biological sample. 

30. The antibody of claim 10, wherein the antibody is: 
10 a) a chimeric antibody, 

b) a single chain antibody, 

c) a Fab fragment, 

d) a F(ab') 2 fragment, or 

e) a humanized antibody. 

15 

31. A composition comprising an antibody of claim 10 and an acceptable excipient 

32. A method of diagnosing a condition or disease associated with the expression of TRICH 
in a subject, comprising administering to said subject an effective amount of the composition of claim 

20 31. 

33. A composition of claim 31, wherein the antibody is labeled. 

34. A method of diagnosing a condition or disease associated with the expression of TRICH 
25 in a subject, comprising administering to said subject an effective amount of the composition of claim 

33. 

35. A method of preparing a polyclonal antibody with the specificity of the antibody of claim 
10 comprising: 

30 a) immunizing an animal with a polypeptide having an amino acid sequence selected from 

the group consisting of SEQ ID NO: 1-32, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibodies from said animal; and 
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c) screening the isolated antibodies with the polypeptide, thereby identifying a polyclonal 
antibody which binds specifically to a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-32. 

5 36. An antibody produced by a method of claim 35. 

37. A composition comprising the antibody of claim 36 and a suitable carrier. 

38. A method of making a monoclonal antibody with the specificity of the antibody of claim 
10 10 comprising: 

a) immunizing an animal with a polypeptide having an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-32, or an immunogenic fragment thereof, under conditions to 
elicit an antibody response; 

b) isolating antibody producing cells from the animal; 

15 c) fusing the antibody producing cells with immortalized cells to form monoclonal antibody- 

producing hybridoma cells; 

d) culturing the hybridoma cells; and 

e) isolating from the culture monoclonal antibody which binds specifically to a polypeptide 
having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

20 

39. A monoclonal antibody produced by a method of claim 38. 

40. A composition comprising the antibody of claim 39 and a suitable carrier. 

25 41 . The antibody of claim 10, wherein the antibody is produced by screening a Fab 

expression library. 

42. The antibody of claim 10, wherein the antibody is produced by screening a recombinant 
immunoglobulin library. 

30 

43. A method for detecting a polypeptide having an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-32 in a sample, comprising the steps of: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 
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b) detecting specific binding, wherein specific binding indicates the presence of a 
polypeptide having an amino acid sequence selected from the group consisting of SEQ ID NO: 1-32 in 
the sample. 

5 44. A method of purifying a polypeptide having an amino acid sequence selected from the 

group consisting of SEQ ID NO: 1-32 from a sample, the method comprising: 

a) incubating the antibody of claim 10 with a sample under conditions to allow specific 
binding of the antibody and the polypeptide; and 

b) separating the antibody from the sample and obtaining the purified polypeptide having an 
10 amino acid sequence selected from the group consisting of SEQ ID NO: 1-32. 

45. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l. 

46. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:2. 

15 

47. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:3. 

48. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:4. 
20 49. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:5. 

50. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:6. 

51. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:7. 

25 

52. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 8. 

53. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:9. 
30 54. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 10. 

55. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:l 1. 

56. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 12. 
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57. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 13. 

58. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 14. 

59. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 15. 

60. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 16. 

61. A polypeptide of claim 1 , comprising the amino acid sequence of SEQ ID NO: 1 7. 

62. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 18. 

63. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 19. 

64. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:20. 

65. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:21. 

66. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:22. 

* 

67. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:23. 

68. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:24. 

69. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:25. 

70. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:26. 

71. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:27. 

72. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:28. 

73. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:29. 
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74. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO: 30. 

75. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:31. 
5 76. A polypeptide of claim 1, comprising the amino acid sequence of SEQ ID NO:32. 

77, A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:33. 

10 78. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:34. 

79. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:35. 

15 

80. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:36. 

81 . A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

20 NO:37. 

82. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:38. 

25 83. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:39. 

84. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:40. 

30 

85. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:41. 
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86. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 



NO:42. 



87. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 



5 NO:43. 



88. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 



NO:44. 



10 89. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:45. 

90. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 



NO:46. 



15 



91. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 



NO:47. 



92. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 



20 NO:48. 



93. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 



NO:49. 



25 94. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:50. 

95. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 



NO:51. 



30 



96. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 



NO:52. 
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97. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:53. 

98. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

5 NO:54. 

99. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:55. 

10 100. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO:56. 

101 . A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:57. 

15 

102. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:58. 

1 03 . A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

20 NO:59. 

104. A polynucleotide of claim 11, comprising the polynucleotide sequence of SEQ ID 

NO:60. 

25 105. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:61. 

106. A polynucleotide of claim 1 1, comprising the polynucleotide sequence of SEQ ID 

NO: 62. 

30 

107. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:63. 
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108. A polynucleotide of claim 1 1 , comprising the polynucleotide sequence of SEQ ID 

NO:64. 
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Lys 


Asp 




95 




Gin 


Val 


Asn 




110 




Gin 


Gin 


Glu 




125 




Leu 


Glu 


Asn 




140 




Ser 


Ser 


Glu 




155 




Asp 


Gly 


GlU 




170 




Ser 


Glu 


Leu 




185 




Val 


He 


Cys 




200 




Thr 


Leu 


Tyr 




215 




Met 


Leu 


Leu 




23 0 




Gly 


Leu 


Val 




245 




Ser 


Gly 


Arg 




260 




Asn 


Thr 


Leu 




275 




Val 


He 


Leu 




290 




Met 


Arg 


Phe 




305 




Ala 


Phe 


Phe 




320 




Leu 


Asn 


Thr 




335 




He 


Arg 


Leu 




350 




Phe 


Cys 


Met 




365 




Leu 


Asn 


Glu 




380 




Thr 


Gly 


Thr 




395 




He 


Asn 


Gly 




410 




Lys 


Ala 


Glu 




425 




Asn 


Pro 


Leu 




440 




Leu 


Glu 


Ala 




455 




Arg 


Leu 


Leu 




470 




Glu 


Gly 


Glu 




485 




Leu 


Val 


Thr 




500 




Thr 


Pro 


Lys 




515 




Tyr 


Gin 


Leu 




530 




Met 


Ser 


Val 




545 




Cys 


Lys 


Gly 




560 





He Val Pro 
Ala Thr Asp 
Asn Arg Gin 
Gin Trp Met 
Asn Gin Phe 
Pro His Gly 
Thr Asn Met 
Gly Asp He 
Glu Pro Pro 
Trp Lys Glu 
Arg Gly Cys 
He Phe Ala 
Thr Lys Phe 
Val Leu Trp 
Ala He Gly 
Gin Val Tyr 
Ser Gly Phe 
Val Val Pro 
Gly His Ser 
Lys Lys Arg 
Glu Leu Gly 
Leu Thr Gin 
His Ser Tyr 
Leu Gly Glu 
Ala Asp Lys 
Val Lys He 
Ser Leu Cys 
Leu Tyr Tyr 
Ala Ala Arg 
Thr He Thr 
Leu Ala He 
He Val Arg 
Ala Asp Thr 



70 






Leu 


Val 


Leu 


85 






Asp 


Tyr 


Phe 


100 






Ser 


Gin 


Val 


115 






Asn 


Val 


Cys 


130 






Val 


Ala 


Ala 


145 






Leu 


Cys 


Tyr 


160 






Lys 


Val 


Arg 


175 






Ser 


Lys 


Leu 


190 






Asn 


Asn 


Lys 


205 






Asn 


Lys 


Phe 


220 






Val 


Leu 


Arg 


235 






Gly 


Pro 


Asp 


250 






Lys 


Arg 


Thr 


265 






He 


Phe 


Gly 


280 






Asn 


Ala 


He 


295 






Leu 


Pro 


Trp 


310 






Leu 


Ser 


Phe 


325 






He 


Ser 


Leu 


340 






Tyr 


Phe 


He 


355 






Thr 


Pro 


Ala 


370 






Gin 


Val 


Glu 


385 






Asn 


He 


Met 


400 






Gly Asp Val 


415 






Arg 


Pro 


Glu 


430 






Lys 


Phe 


Leu 


445 






Gly Asp 


Pro 


460 






His 


Thr 


Val 


475 






Lys 


Ala 


Gin 


490 






Asn 


Phe Gly 


505 






Val 


His 


Glu 


520 






Leu 


Asp 


Phe 


535 






Asn 


Pro 


GlU 


550 






He 


Leu 


Leu 


565 







5/87 







75 


Val 


Leu 


Thr 
90 


Arg 


His 


Lys 
105 


Leu 


He 


Asn 
120 


Val 


Gly 


Asp 
135 


Asp 


Leu 


Leu 
150 


He 


Glu 


Thr 
165 


Gin 


Ala 


He 
180 


Ala 


Lys 


Phe 
195 


Leu 


Asp 


Lys 
210 


Pro 


Leu 


Ser 
225 


Asn 


Thr 


Glu 
240 


Thr 


Lys 


Leu 
255 


Ser 


He 


Asp 
270 


Phe 


Leu 


Val 
285 


Trp 


Glu 


His 
300 


Asp 


Glu 


Ala 
315 


Trp 


Ser 


Tyr 
330 


Tyr 


Val 


Ser 
345 


Asn 


Trp 


Asp 
360 


Glu 


Ala 


Arg 
375 


Tyr 


He 


Phe 
390 


Val 


Phe 


Asn 
405 


Phe 


Asp 


Val 
420 


Pro 


Val 


Asp 
435 


Phe 


Trp 


Asp 
450 


His 


Thr 


His 
465 


Met 


Ser 


Glu 
480 


Ser 


Pro 


Asp 
495 


Phe 


Val 


Phe 
510 


Met 


Gly 


Thr 
525 


Asn 


Asn 


He 
540 


Gly 


Lys 


He 
555 


Asp 


Arg 


Leu 
570 
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His 


His 


Ser 


Thr 


Gin 


Glu 


Leu 


Leu 


Asn 


Thr 


Thr 


Met 


Asp 


His 


Leu 










575 










580 










585 


Asn 


Glu 


Tyr 


Ala 


Gly 


GlU 


Gly Leu Arg 


Thr 


Leu 


Val 


Leu 


Ala 


Tyr 








590 










595 










600 


Lys 


Asp 


Leu Asp 


Glu 


Glu Tyr Tyr Glu Glu Trp Ala Glu Arg Arg 








605 










610 










615 


Leu 


Gin 


Ala 


Ser 


Leu 


Ala 


Gin 


Asp 


Ser 


Arg 


Glu 


Asp 


Arg 


Leu 


Ala 










620 










625 










630 


ser 


lie 


Tvr 


Glu 


Glu 


Val 


Glu 


Asn 


Asn 


Met 


Met 


Leu 


Leu Gly Ala 








635 










640 










645 


Thr 


Ala 


He 


Glu 


Asp 


Lys 


Leu 


Gin 


Gin 


Gly Val 


Pro 


Glu 


Thr 


He 










650 








65'5 










660 


Ala 


Leu 


Leu 


Thr 


Leu 


Ala 


Asn 


He 


Lys 


He 


Trp 


Val 


Leu 


Thr 


Gly 










665 










670 










675 


Asp 


Lys 


Gin 


Glu 


Thr 


Ala 


Val 


Asn 


He 


Gly 


Tyr 


Ser 


Cys 


Lys 


Met 






680 










685 










690 


Leu 


Thr 


Asp 


Asp 


Met 


Thr 


Glu 


Val 


Phe 


He 


Val 


Thr 


Gly 


His 


Thr 






695 










700 










705 


Val 


Leu 


Glu 


Val 


Arg 


Glu 


Glu 


Leu 


Arg 


Lys 


Ala 


Arg 


Glu 


Lys 


Met 










710 










715 










720 


Met 


Asp 


Ser 


Ser 


Arg 


Ser 


Val 


Gly 


Asn 


Gly 


Phe 


Thr 


Tyr 


Gin 


Asp 








725 










730 










735 


Lys 


Leu 


Ser 


Ser 


Ser 


Lys 


Leu 


Thr 


Ser 


Val 


Leu 


Glu 


Ala 


Val 


Ala 








740 










745 










750 


Gly Glu 


Tyr 


Ala 


Leu 


Val 


He 


Asn 


Gly 


His 


Ser 


Leu 


Ala 


His 


Ala 








755 










760 










765 


Leu 


Glu Ala Asp 


Met 


Glu 


Leu 


Glu 


Phe 


Leu 


Glu 


Thr 


Ala 


Cys 


Ala 










770 










775 










780 


Cys 


Lys 


Ala 


Val 


He 


Cys ,Cys 


Arg 


Val 


Thr 


Pro 


Leu 


Gin 


Lys 


Ala 






785 










790 










795 


Gin 


Val 


Val 


Glu 


Leu 


Val 


Lys 


Lys 


Tyr 


Lys 


Lys 


Ala 


Val 


Thr 


Leu 










800 








8 05 










810 


Ala 


lie 


Gly Asp 


Gly 


Ala 


Asn Asp 


Val 


Ser 


Met 


He 


Lys 


Thr 


Ala 










815 










820 










825 


His 


He 


Gly Val 


Gly 


He 


Ser Gly Gin Glu Gly 


He 


Gin 


Ala 


Val 










830 










835 










840 


Leu 


Ala 


Ser 


Asp 


Tyr 


Ser 


Phe 


Ser 


Gin 


Phe 


Lys 


Phe 


Leu 


Gin 


Arg 








845 










850 










855 


Leu 


Leu 


Leu 


Val 


His 


Gly Arg 


Trp 


Ser 


Tyr 


Leu Arg Met 


Cys 


Lys 










860 








865 










87 0 


Phe 


Leu 


Cys 


Tyr 


Phe 


Phe 


Tyr 


Lys 


Asn 


Phe 


Ala 


Phe 


Thr 


Met 


Val 








875 










880 










885 


His 


Phe 


Trp 


Phe 


Gly 


Phe 


Phe 


Cys 


Gly 


Phe 


Ser 


Ala 


Gin 


Thr 


Val 








890 










895 










900 


Tyr 


Asp 


Gin 


Tyr 


Phe 


He 


Thr 


Leu 


Tyr 


Asn 


He 


Val 


Tyr 


Thr 


Ser 






905 










910 










915 


Leu 


Pro 


Val 


Leu 


Ala 


Met 


Gly Val 


Phe 


Asp 


Gin 


Asp 


Val 


Pro 


Glu 










920 










925 










930 


Gin 


Arg 


Ser 


Met 


Glu 


Tyr 


Pro 


Lys 


Leu 


Tyr 


Glu 


Pro 


Gly Gin 


Leu 








935 










940 










945 


Asn 


Leu 


Leu 


Phe 


Asn 


Lys 


Arg 


Glu 


Phe 


Phe 


He 


Cys 


He 


Ala 


Gin 










950 








955 










960 


Gly 


He 


Tyr 


Thr 


Ser 


Val 


Leu 


Met 


Phe 


Phe 


He 


Pro 


Tyr 


Gly 


Val 






965 










970 










975 


Phe 


Ala 


Asp 


Ala 


Thr 


Arg 


Asp 


Asp 


Gly 


Thr 


Gin 


Leu 


Ala 


Asp 


Tyr 










980 










985 










990 


Gin 


Ser 


Phe 


Ala 


Val 


Thr 


Val 


Ala 


Thr 


Ser 


Leu 


Val 


He 


Val 


Val 










995 










1000 










1005 


Ser 


Val 


Gin 


He 


Gly 


Leu 


Asp 


Thr 


Gly 


Tyr 


Trp 


Thr 


Ala 


He 


Asn 










1010 










1015 










1020 


His 


Phe 


Phe 


He 


Trp 


Gly 


Ser 


Leu 


Ala 


Val 


Tyr 


Phe 


Ala 


He 


Leu 










1025 








1030 










1035 


Phe 


Ala 


Met 


His 


Ser 


Asn 


Gly 


Leu 


Phe 


Asp 


Met 


Phe 


Pro 


Asn 


Gin 










1040 








1045 










1050 


Phe 


Arg 


Phe 


Val 


Gly 


Asn 


Ala 


Gin 


Asn 


Thr 


Leu 


Ala 


Gin 


Pro 


Thr 








1055 










1060 










1065 


Val 


Trp 


Leu 


Thr 


He 


Val 


Leu 


Thr 


Thr 


Val 


Val 


Cys 


He 


Met 


Pro 



6/87 



0204520A2J_> 



WO 02/04520 



PCT/US01/21448 









1070 


Val 


Val 


Ala 


Phe Arg 








1085 


Ser 


Asp 


Thr 


Val Arg 








1100 


Ala 


Gin 


His 


Arg Cys 








1115 


Arg 


Ser 


Gly 


Tyr Ala 








1130 


He 


Met 


Ser 


Gly Lys 








1145 


Ser 


Phe 


Thr 


Thr Arg 








1160 


Arg 


Lys 


Lys 


Ser Asp 








1175 


Lys 


Pro 


Leu 


Lys Gly 








1190 
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Met 


Val 


Leu 


Ala 


Phe 


1 








5 


He 


Leu 


Lys 


Pro 


Asn 
20 


His 


Gin 


Arg 


Cys 


Ser 

-J Zj 


Thr 


Arg 


Met 


Lys 


Lys 
50 


Tyr 


Glu 


Gin 


Leu 


Leu 
65 


Pro 


Gly 


Phe 


Gly 


Gly 
80 


Val 


Glu 


Ser 


He 


Asp 
95 


Met 


Thr 


Phe 


Tyr 


Leu 
110 


Phe 


Pro 


Ser 


Thr 


Ala 
125 


Thr 


Arg 


Lys 


He 


Trp 
140 


Arg 


Ser 


Phe 


He 


His 
155 


Val 


His 


Pro 


Asp 


Gly 
170 


Ser 


Ala 


Met 


Cys 


Phe 
185 


Gin 


Asn 


Cys 


Ser 


Leu 
200 


Asp 


Leu 


Met 


Leu 


Tyr 
215 


Glu 


Glu 


His 


Met 


Ser 
230 


Ala 


Ser 


Ser 


Gly 


Leu 
245 


Arg 


Leu 


Phe 


He 


He 
260 


Val 


Leu 


Pro 


Thr 


Tyr 
275 


Trp 


Val 


Ser 


Phe 


Trp 









1075 


Phe 


Leu 


Arg 


Leu Asn 








1090 


Tyr 


Thr 


Gin 


Leu Val 








1105 


Met 


Arg 


Arg 


Val Gly 








1120 


Phe 


Ser 


His 


Gin Glu 








1135 


Asn 


Met 


Arg 


Leu Ser 








1150 


Ser 


Ser 


Ser 


Ser Trp 








1165 


Ser 


Ala 


Ser 


Ser Pro 








1180 
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Gin 


Leu 


Val 


Ser 


Phe 
10 


Val 


Cys 


Ala 


Ala 


Ser 
25 


Ser 


Ser 


Met 


Lys 


Gin 

4 U 


Asp 


Asp 


Ser 


Thr 


Lys 
55 


His 


lie 


Glu 


Asp 


Asn 
70 


Ser 


Pro 


Val 


Pro 


Val 
85 


Ser 


He 


Ser 


Glu 


Thr 
100 


Arg 


His 


Tyr 


Trp 


Lys 
115 


Asn 


Lys 


Ser 


Met 


Thr 
130 


Val 


Pro 


Asp 


He 


Phe 








145 


Asp 


Thr 


Thr 


Met 


Glu 

160 


Asn 


Val 


Leu 


Leu 


Ser 
175 


Met 


Asp 


Phe 


Ser 


Arg 
190 


Glu 


Leu 


Glu 


Ser 


Tyr 
205 


Trp 


Lys 


His 


Gly 


Asn 
220 


Leu 


Ser 


Gin 


Phe 


Phe 

235 


Ala 


Phe 


Tyr 


Ser 


Ser 
250 


Ser 


Val 


Leu 


Arg 


Arg 
265 


Tyr 


Pro 


Ala 


He 


Leu 
280 


He 


Asp 


Arg 


Arg 


Ala 
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1080 



Leu 


Lys 


Pro 


Asp 


Leu 








1095 


Arg 


Lys 


Lys 


Gin 


Lys 








1110 


Arg 


Thr 


Gly 


Ser 


Arg 








1125 


Gly 


Phe 


Gly 


Glu 


Leu 








1140 


Ser 


Leu 


Ala 


Leu 


Ser 








1155 


He 


Glu 


Ser 


Leu 


Arg 








1170 


Ser 


Gly 


Gly 


Ala 


Asp 








1185 


Thr 


Tyr 


He 




He 










15 


Asn 


He 


Lys 


Met 


Thr 










30 


Thr 


Cys 


Lys 


Gin 


Glu 










45 


Ala 


Arg 


Pro 


Gin 


Lys 










60 


Asp 


Phe 


Ala 


Met 


Arg 










75 


Gly 


He 


Asp 


Val 


His 










90 


Asn 


Met 


Asp 


Phe 


Thr 










105 


Asp 


Glu 


Arg 


Leu 


Ser 










120 


Phe 


Asp 


His 


Arg 


Leu 










135 


Phe 


Val 


His 


Ser 


Lys 










150 


Asn 


He 


Met 


Leu 


Arg 










165 


Leu 


Arg 


He 


Thr 


Val 




Pro 






180 


Phe 


Leu 


Asp 


Thr 










195 


Ala 


Tyr 


Asn 


Glu 


Asp 










210 


Lys 


Ser 


Leu 


Asn 


Thr 










225 


He 


Glu 


Asp 


Phe 


Ser 










240 


Thr 


Gly 


Trp 


Tyr 


Asn 










255 


His 


Val 


Phe 


Phe 


Phe 










270 


Met 


Val 


Met 


Leu 


Ser 










285 


Val 


Pro 


Ala 


Arg 


Val 
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290 


ser 


Leu 


Gly 


He 


Thr 
305 


Ala 


Val 


Ser 


Ala 


Ser 
320 


Asp 


Val 


Tyr 


Leu 


Trp 
335 


He 


Glu 


Tyr 


Ala 


Ala 
350 


Lys 


Gin 


Phe 


Lys 


Lys 
365 


Asp 


Ala 


Val 


Gin 


Ala 
380 


Glu 


He 


Asp 


Met 


Asp 
395 


Phe 


Met 


Arg 


Arg 


Lys 
410 


Arg 


He 


Lys 


Arg 


Arg 
425 


He 


Leu 


Glu 


Asn 


Asn 
440 


Phe 


Pro 


He 


Val 


Tyr 
455 


Tyr 


Val 
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Met 


Pro 


Val 


Arg 


Arg 


1 








5 


Gly 


Thr 


He 


He 


Arg 
20 


He 


Ala 


Asn 


Ala 


Arg 
35 


Asp 


Gly 


Phe 


cys 


Glu 
50 


Gin 


Lys 


Pro 


Cys 


Thr 

65 


Arg 


His 


Asp 


He 


Ala 
80 


Glu 


Arg 


Lys 


Val 


Glu 
95 


Phe 


He 


Cys 


Asn 


Thr 
110 


Val 


Ala 


Met 


Met 


Phe 
125 


Glu 


Asn 


Ala 


Ala 


Thr 
140 


Lys 


Thr 


Val 


Asn 


Arg 
155 


Arg 


Val 


Leu 


Thr 


Tyr 
17 0 


Asp 


Val 


Val 


Val 


He 








185 


Ala 


Met 


Lys 


His 


Phe 
200 


Ser 


Glu 


Ala 


Asp 


Asp 
215 


Ser 


Pro 


Leu 


Val 


Asn 



295 

Thr Val Leu Thr Met 
310 

Met Pro Gin Val Ser 
325 

Val Ser Ser Leu Phe 
340 

Val Asn Tyr Leu Thr 
355 

Thr Gly Lys He Ser 
370 

Met Ala Phe Asp Gly 
385 

Gin Thr Ser Leu Ser 
400 

Ser He Cys Ser Pro 
415 

Lys Ser Leu Gly Gly 
430 

His Val He Asp Thr 
445 

He Leu Phe Asn Leu 
460 



7474240CD1 



Gly 


His 


Val 


Ala 


Pro 








10 


Lys 


Phe 


Glu 


Gly 


Gin 
25 


Val 


Gin 


Asn 


Cys 


Ala 
40 


Met 


Thr 


Gly 


Phe 


Ser 








55 


Cys 


Asp 


Phe 


Leu 


His 
70 


Gin 


He 


Ala 


Gin 


Ala 
85 


Val 


Thr 


Tyr 


Tyr 


His 
100 


His 


He 


He 


Pro 


Val 
115 


He 


He 


Asn 


Phe 


Glu 
130 


Pro 


Glu 


Arg 


Val 


Asn 
145 


Lys 


Phe 


Phe 


Gly 


Phe 
160 


Arg 


Lys 


Gin 


Ser 


Leu 
175 


Asp 


Ser 


Ser 


Lys 


His 
190 


Lys 


Ser 


Pro 


Thr 


Lys 
205 


Thr 


Lys 


Ala 


Leu 


He 
220 


He 


Ser 


Gly 


Pro 


Leu 
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300 



Ser 


Thr 


He 


He 


Thr 










315 


Tyr 


Leu 


Lys 


Ala 


Val 










330 


Val 


Phe 


Leu 


Ser 


Val 










345 


Thr 


Val 


Glu 


Glu 


Arg 










360 


Arg 


Met 


Tyr 


Asn 


He 










375 


Cys 


Tyr 


His 


Asp 


Ser 










390 


Leu 


Asn 


Ser 


Glu 


Asp 










405 


Ser 


Thr 


Asp 


Ser 


Ser 










420 


His 


Val 


Gly 


Arg 


He 










435 


Tyr 


Ser 


Arg 


He 


Leu 










450 


Phe 


Tyr 


Trp 


Gly 


Val 












Gin 


Asn 


Thr 


Phe 


Leu 










15 


Asn 


Lys 


Lys 


Phe 


He 










30 


He 


He 


Tyr 


Cys 


Asn 










45 


Arg 


Pro 


Asp 


Val 


Met 










60 


Gly 


Pro 


Glu 


Thr 


Lys 










75 


Leu 


Leu 


Gly 


Ser 


Glu 








90 


Lys 


Asn 


Gly 


Ser 


Thr 










105 


Lys 


Asn 


Gin 


Glu 


Gly 










120 


Tyr 


Val 


Thr 


Asp 


Asn 










135 


Pro 


He 


Leu 


Pro 


He 










150 


Lys 


Phe 


Pro 


Gly 


Leu 










165 


Pro 


Gin 


Glu 


Asp 


Pro 










180 


Ser 


Asp 


Asp 


Ser 


Val 










195 


Glu 


Ser 


Cys 


Ser 


Pro 










210 


Gin 


Pro 


Ser 


Lys 


Cys 










225 


Asp 


His 


Ser 


Ser 


Pro 
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230 235 240 



Lys 


Arg 


Gin 


Trp 


Asp 
245 


Arg 


Leu 


Tyr 


Pro 


Asp 
250 


Met 


Leu 


Gin 


Ser 


Ser 
255 


Ser 


Gin 


Leu 


Ser 


His 
260 


Ser 


Arg 


Ser 


Arg 


Glu 
265 


Ser 


Leu 


Cys 


Ser 


He 
270 


Arg 


Arg 


Ala 


Ser 


Ser 
275 


Val 


His 


Asp 


He 


Glu 
280 


Gly 


Phe 


Gly 


Val 


His 
285 


Pro 


Lys 


Asn 


He 


Phe 
290 


Arg 


Asp 


Arg 


His 


Ala 
295 


Ser 


Glu 


Asp 


Asn 


Gly 
300 


Arg 


Asn 


Val 


Lys 


Gly 
305 


Pro 


Phe 


Asn 


His 


He 
310 


Lys 


Ser 


Ser 


Leu 


Leu 
315 


Gly 


Ser 


Thr 


Ser 


Asp 
320 


Ser 


Asn 


Leu 


Asn 


Lys 
325 


Tyr 


Ser 


Thr 


He 


Asn 
330 


Lys 


He 


Pro 


Gin 


Leu 


Thr 


Leu 


Asn 


Phe 


Ser 


Glu 


Val 


Lys 


Thr 


Glu 










335 










340 








345 


Lys 


Lys 


Asn 


Ser 


Ser 
350 


Pro 


Pro 


Ser 


Ser 


Asp 
355 


Lys 


Thr 


He 


He 


Ala 
360 


Pro 


Lys 


Val 


Lys 


Asp 
365 


Arg 


Thr 


His 


Asn 


Val 
370 


Thr 


Glu 


Lys 


Val 


Thr 
375 


Gin 


Val 


Leu 


Ser 


Leu 
380 


Gly 


Ala 


Asp 


Val 


Leu 
385 


Pro 


Glu 


Tyr 


Lys 


Leu 
390 


Gin 


Thr 


Pro 


Arg 


He 
395 


Asn 


Lys 


Phe 


Thr 


He 
400 


Leu 


His 


Tyr 


Ser 


Pro 
405 


Phe 


Lys 


Ala 


Val 


Trp 
410 


Asp 


Trp 


Leu 


He 


Leu 
415 


Leu 


Leu 


Val 


He 


Tyr 
420 


Thr 


Ala 


He 


Phe 


Thr 
425 


Pro 


Tyr 


Ser 


Ala 


Ala 
430 


Phe 


Leu 


Leu 


Asn 


Asp 
435 


Arg 


Glu 


Glu 


Gin 


Lys 
440 


Arg 


Arg 


Glu 


Cys 


Gly 
445 


Tyr 


Ser 


Cys 


Ser 


Pro 
450 


Leu 


Asn 


Val 


Val 


Asp 
455 


Leu 


He 


Val 


Asp 


He 
460 


Met 


Phe 


He 


He 


Asp 
465 


He 


Leu 


He 


Asn 


Phe 
470 


Arg 


Thr 


Thr 


Tyr 


Val 
475 


Asn 


Gin 


Asn 


Glu 


GlU 
480 


Val 


Val 


Ser 


Asp 


Pro 
485 


Ala 


Lys 


He 


Ala 


He 
490 


His 


Tyr 


Phe 


Lys 


Gly 
495 


Trp 


Phe 


Leu 


He 


Asp 
500 


Met 


Val 


Ala 


Ala 


He 
505 


Pro 


Phe 


Asp 


Leu 


Leu 
510 


He 


Phe 


Gly 


Ser 


Gly 
515 


Ser 


Asp 


Glu 


Thr 


Thr 
520 


Thr 


Leu 


He 


Gly 


Leu 
525 


Leu 


Lys 


Thr 


Ala 


Arg 
530 


Leu 


Leu 


Arg 


Leu 


Val 
535 


Arg 


Val 


Ala 


Arg 


Lys 

540 


Leu 


Asp 


Arg 


Tyr 


Ser 
545 


Glu 


Tyr 


Gly 


Ala 


Ala 
550 


Val 


Leu 


Met 


Leu 


Leu 
555 


Met 


Cys 


He 


Phe 


Ala 
560 


Leu 


He 


Ala 


His 


Trp 
565 


Leu 


Ala 


Cys 


He 


Trp 
570 


Tyr 


Ala 


He 


Gly 


Asn 
575 


Val 


Glu 


Arg 


Pro 


Tyr 
580 


Leu 


Thr 


Asp 


Lys 


He 
585 


Gly 


Trp 


Leu 


Asp 


Ser 
590 


Leu 


Gly 


Gin 


Gin 


He 
595 


Gly 


Lys 


Arg 


Tyr 


Asn 
600 


Asp 


Ser 


Asp 


Ser 


Ser 
605 


Ser 


Gly 


Pro 


Ser 


He 
610 


Lys 


Asp 


Lys 


Tyr 


Val 
615 


Thr 


Ala 


Leu 


Tyr 


Phe 


Thr 


Phe 


Ser 


Ser 


Leu 


Thr 


Ser 


Val 


Gly 


Phe 










620 










625 








630 


Gly 


Asn 


Val 


Ser 


Pro 
635 


Asn 


Thr 


Asn 


Ser 


Glu 
640 


Lys 


He 


Phe 


Ser 


He 
645 


Cys 


Val 


Met 


Leu 


He 
650 


Gly 


Ser 


Leu 


Met 


Tyr 
655 


Ala 


Ser 


He 


Phe 


Gly 
660 


Asn 


Val 


Ser 


Ala 


He 
665 


He 


Gin 


Arg 


Leu 


Tyr 
670 


Ser 


Gly 


Thr 


Ala 


Arg 
675 


Tyr 


His 


Met 


Gin 


Met 
680 


Leu 


Arg 


Val 


Lys 


Glu 
685 


Phe 


He 


Arg 


Phe 


His 
690 


Gin 


He 


Pro 


Asn 


Pro 
695 


Leu 


Arg 


Gin 


Arg 


Leu 
700 


Glu 


Glu 


Tyr 


Phe 


Gin 
705 


His 


Ala 


Trp 


Thr 


Tyr 
710 


Thr 


Asn 


Gly 


He 


Asp 
715 


Met 


Asn 


Met 


Val 


Leu 
720 


Lys 


Gly 


Phe 


Pro 


Glu 




Leu 


Gin 


Ala 


Asp 


He 


Cys 


Leu 


His 


Leu 



725 730 735 
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Asn 


Gin 


Thr 


Leu 


Leu 


Gin 


Asn 


Cys 


Lys 


Ala 


Phe 


Arg 


Gly 


Ala Ser 










740 










745 








750 


Lys 


Gly 


Cys 


Leu 


Arg 


Ala 


Leu 


Ala 


Met 


Lys 


Phe 


Lys 


Thr 


Thr His 








755 










760 








765 


Ala 


Pro 


Pro 


Gly 


Asp 


Thr 


Leu 


Val 


His 


Cys 


Gly 


Asp 


Val 


Leu Thr 










770 










775 








780 


Ala 


Leu 


Tyr 


Phe 


Leu 


Ser Arg Gly 


Ser 


He 


Glu 


He 


Leu 


Lys Asp 








785 










790 








795 


Asp 


lie 


Val 


Val 


Ala 


He 


Leu Gly 


Lys 


Asn 


Asp 


He 


Phe 


Gly Glu 










800 










805 








810 


Met 


Val 


His 


Leu 


Tyr 


Ala 


Lys 


Pro 


Gly 


Lys 


Ser 


Asn 


Ala 


Asp Val 










815 










820 








825 


Arg 


Ala 


Leu 


Thr 


Tyr 


Cys 


Asp 


Leu 


His 


Lys 


He 


Gin 


Arg 


Glu Asp 










830 










835 








840 


Leu 


Leu 


Glu 


Val 


Leu 


Asp 


Met 


Tyr 


Pro 


Glu 


Phe 


Ser 


Asp 


His Phe 










845 










850 








855 


Leu 


Thr 


Asn 


Leu 


Glu 


Leu 


Thr 


Phe 


Asn 


Leu 


Arg 


His 


Glu 


Ser Ala 










860 










865 








870 


Lys 


Ala 


Asp 


Leu 


Leu 


Arg 


Ser 


Gin 


Ser 


Met 


Asn 


Asp 


Ser 


Glu Gly 








875 










880 








885 


Asp 


Asn 


Cys 


Lys 


Leu 


Arg 


Arg 


Arg 


Lys 


Leu 


Ser 


Phe 


Glu 


Ser Glu 










890 










895 








900 


Gly 


Glu 


Lys 


Glu 


Asn 


Ser 


Thr 


Asn 


Asp 


Pro 


Glu 


Asp 


Ser 


Ala Asp 










905 










910 








915 


Thr 


He 


Arg 


His 


Tyr 


Gin 


Ser 


Ser 


Lys 


Arg 


His 


Phe 


Glu 


Glu Lys 










920 










925 








930 


Lys 


Ser 


Arg 


Ser 


Ser 


Ser 


Phe 


He 


Ser 


Ser 


He 


Asp 


Asp 


Glu Gin 










935 










940 








945 


Lys 


Pro 


Leu 


Phe 


Ser 


Gly 


He 


Val 


Asp 


Ser 


Ser 


Pro 


Gly 


He Gly 










950 










955 








960 


Lys 


Ala 


Ser 


Gly 


Leu 


Asp 


Phe 


Glu 


Glu 


Thr 


Val 


Pro 


Thr 


Ser Gly 








965 








970 








975 


Arg 


Met 


His 


He 


Asp 


Lys 


Arg 


Ser 


His 


Ser 


Cys 


Lys 


Asp 


He Thr 










980 










985 








990 


Asp 


Met 


Arg 


Ser 


Trp 


Glu 


Arg 


Glu 


Asn 


Ala 


His 


Pro 


Gin 


Pro Glu 










995 










1000 








1005 


Asp 


Ser 


Ser 


Pro 


Ser 


Ala 


Leu 


Gin 


Arg 


Ala 


Ala 


Trp 


Gly 


He Ser 










1010 










1015 








1020 


Glu 


Thr 


Glu 


Ser 


Asp 


Leu 


Thr 


Tyr 


Gly 


Glu 


Val 


Glu 


Gin 


Arg Leu 










1025 










1030 








1035 


Asp 


Leu 


Leu 


Gin 


Glu 


Gin 


Leu 


Asn 


Arg 


Leu 


Glu 


Ser 


Gin 


Met Thr 










1040 










1045 








1050 


Thr 


Asp 


He 


Gin 


Thr 


He 


Leu 


Gin 


Leu 


Leu 


Gin 


Lys 


Gin 


Thr Thr 










1055 










1060 








1065 


Val 


Val 


Pro 


Pro 


Ala 


Tyr 


Ser 


Met 


Val 


Thr 


Ala 


Gly 


Ser 


Glu Tyr 










1070 










1075 








1080 


Gin 


Arg 


Pro 


He 


He 


Gin 


Leu 


Met 


Arg 


Thr 


Ser 


Gin 


Pro 


Glu Ala 










1085 










1090 








1095 


Ser 


He 


Lys 


Thr 


Asp 


Arg 


Ser 


Phe 


Ser 


Pro 


Ser 


Ser 


Gin 


Cys Pro 










1100 










1105 








1110 


Glu 


Phe 


Leu 


Asp 


Leu 


Glu 


Lys 


Ser 


Lys 


Leu 


Lys 


Ser 


Lys 


Glu Ser 










1115 










1120 








1125 


Leu 


Ser 


Ser 


Gly Val 


His 


Leu 


Asn 


Thr 


Ala 


Ser 


Glu 


Asp 


Asn Leu 










1130 










1135 








1140 


Thr 


Ser 


Leu 


Leu 


Lys 


Gin 


Asp 


Ser 


Asp 


Leu 


Ser 


Leu 


Glu 


Leu His 










1145 








1150 








1155 


Leu 


Arg 


Gin 


Arg 


Lys 


Thr 


Tyr 


Val 


His 


Pro 


He 


Arg 


His 


Pro Ser 










1160 










1165 








1170 


Leu 


Pro 


Asp 


Ser 


Ser 


Leu 


Ser 


Thr 


Val 


Gly 


He 


Val 


Gly 


Leu His 










1175 










1180 








1185 


Arg 


His 


Val 


Ser 


Asp 


Pro 


Gly 


Leu 


Pro 


Gly 


Lys 









1190 1195 
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Met 


Glu 


Asn 


Lys 


Glu 


Ala Gly 


Thr 


Pro 


1 








5 










Glu 


Gly 


Arg 


Leu 


Gin 


Pro 


Thr 


Leu 


Leu 










20 










Ala 


Phe 


Gly 


Ser 


Ala 


Phe 


Gin 


Tyr 


Gly 










35 










Asn 


Thr 


Pro 


His 


Lys 


Val 


Phe 


Lys 


Ser 










50 










Phe 


Glu 


Arg 


His 


Ala 


Thr 


Phe 


Met 


Asp 










65 










Leu 


Trp 


Ser 


Cys 


Thr 


Val 


Ser 


Met 


Phe 










80 










Gly 


Ser 


Leu 


Leu 


Val 


Gly Leu 


Leu 


Val 










95 










Gly 


Thr 


Leu 


Leu 


He 


Asn 


Asn 


He 


Phe 










110 










Leu 


Met 


Gly 


Val 


Ser 


Lys 


Val 


Ala 


Lys 










125 










Phe 


Ser 


Arg 


Val 


Val 


Leu Gly 


Val 


Cys 










140 










Ala 


Leu 


Pro 


Met 


Tyr 


Leu Gly 


Glu 


Leu 










155 










Gly 


Met 


Val 


Gly 


Thr 


Met 


Thr 


Glu 


Val 








170 










Phe 


Leu 


Ala 


Gin 


He 


Phe 


Ser 


Leu 


Gin 










185 










Ala 


Gly 


Trp 


Pro 


Val 


Leu 


Leu 


Ala 


Leu 










200 










Leu 


Gin 


Leu 


Leu 


Thr 


Leu 


Pro 


Phe 


Phe 










215 










Ser 


Leu 


lie 


Gin 


Lys 


Gly Asp 


Glu 


Ala 










230 










Arg 


Arg 


Leu 


Arg 


Gly 


His 


Thr 


Asp 


Met 










245 










Met 


Arg 


Ala 


Glu 


Ala 


Arg 


Ala 


GlU 


Arg 










260 










Val 


Leu 


His 


Leu 


Cys 


Ala 


Leu 


Arg 


Ser 










275 










Ser 


He 


He 


Val 


Leu 


Met 


Ala 


Gly Gin 










290 










Ala 


He 


Asn 


Tyr 


Tyr 


Ala 


Asp 


Thr 


He 










305 










Glu 


Ala 


Ala 


His 


Ser 


Gin 


Tyr 


Val 


Thr 










320 










Asn 


He 


Val 


Met 


Thr 


He 


Thr 


Ser 


Ala 










335 










Gly 


Arg 


Arg 


His 


Leu 


Leu 


Leu 


Ala 


Gly 










350 










Ala 


Cys 


Leu 


Val 


Leu 


Thr 


Val 


Val 


Leu 










365 










Pro 


Glu 


Leu 


Ser 


Tyr 


Leu 


Gly 


He 


He 










380 










Ala 


Gly 


His 


Ser 


He 


Gly 


Pro 


Ser 


Pro 










395 










Thr 


Glu 


He 


Phe 


Leu 


Gin 


Ser 


Ser 


Arg 










410 










Asp 


Gly 


Ala 


Val 


His 


Trp 


Leu 


Thr 


Asn 










425 










Phe 


Pro 


Ser 


He 


Gin 


Glu 


Ala 


He Gly 










440 










Phe 


Ala 


Gly 


He 


Cys 


Leu 


Leu 


Thr 


Ala 



455 
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Pro 


Pro 


He 


Pro 


Ser 


Arg 


10 










15 


Leu 


Ala 


Thr 


Leu 


Ser 


Ala 


25 










30 


Tyr 


Asn 


Leu 


Ser 


Val 


Val 


40 










45 


Phe 


Tyr 


Asn 


Glu 


Thr 


Tyr 


55 










60 


Gly 


Lys 


Leu 


Met 


Leu 


Leu 


70 










75 


Pro 


Leu Gly Gly Leu 


Leu 


85 










90 


Asp 


Ser 


Cys 


Gly Arg 


Lys 


100 










105 


Ala 


He 


He 


Pro 


Ala 


He 


115 










120 


Ala 


Phe 


Glu 


Leu 


He 


Val 


130 










135 


Ala 


Gly 


He 


Ser 


Tyr 


Ser 


145 










150 


Ala 


Pro 


Lys 


Asn 


Leu 


Arg 


160 










165 


Phe 


Val 


He 


Val 


Gly 


Val 


175 










180 


Ala 


He 


Leu 


Gly Asn 


Pro 


190 










195 


Thr 


Gly 


Val 


Pro 


Ala 


Leu 


205 










210 


Pro 


Glu 


Ser 


Pro 


Arg 


Tyr 


220 










225 


Thr 


Ala 


Arg 


Gin 


Ala 


Leu 


235 










240 


Glu 


Ala 


Glu 


Leu 


GlU 


Asp 


250 










255 


Ala 


Glu 


Gly 


His 


Leu 


Ser 


265 










270 


Leu 


Arg 


Trp 


Gin 


Leu 


Leu 


280 










285 


Gin 


Leu 


Ser Gly 


lie 


Asn 


295 










300 


Tyr 


Thr 


Ser 


Ala 


Gly 


Val 


310 










315 


Val 


Gly 


Ser Gly Val 


Val 


325 










330 


Val 


Leu 


Val 


Glu 


Arg 


Leu 


340 










345 


Tyr 


Gly 


He 


Cys 


Gly 


Ser 


355 










360 


Leu 


Phe 


Gin 


Asn 


Arg 


Val 


370 










375 


Cys 


Val 


Phe 


Ala 


Tyr 


He 


385 










390 


Val 


Pro 


Ser 


Val 


Val 


Arg 


400 










405 


Arg 


Ala 


Ala 


Phe 


Met 


Val 


415 










420 


Phe 


He 


He Gly Phe 


Leu 


430 










435 


Ala 


Tyr 


Ser 


Phe 


He 


He 


445 










450 


He 


Tyr 


He 


Tyr 


Val 


Val 


460 










465 
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lie Pro Glu Thr Lys Gly Lys Thr Phe Val Glu lie Asn Arg lie 
470 475 480 

Phe Ala Lys Arg Asn Arg Val Lys Leu Pro Glu Glu Lys Glu Glu 
485 490 495 

Thr lie Asp Ala Gly Pro Pro Thr Ala Ser Pro Ala Lys Glu Thr 
500 505 510 

Ser Phe 
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Met 


Thr 


Ala 


Ser 


Thr 


Pro 


Glu 


Ala 


Thr 


Pro 


Asn 


Met 


Glu 


Leu 


Lys 


1 








5 










10 










15 


Ala 


Pro 


Ala 


Ala 


Gly 


Gly Leu 


Asn 


Ala 


Gly 


Pro 


Val 


Pro 


Pro 


Ala 










20 










25 










30 


Ala 


Met 


Ser 


Thr 


Gin 
35 


Arg 


Leu 


Arg 


Asn 


Glu 
40 


Asp 


Tyr 


His 


Asp 


Tyr 
45 


Ser 


Ser 


Thr 


Asp 


Val 


Ser 


Pro 


Glu 


Glu 


Ser 


Pro 


Ser 


Glu Gly 


Leu 








50 










55 










60 


Asn 


Asn 


Leu 


Ser 


Ser 


Pro 


Gly 


Ser 


Tyr 


Gin 


Arg 


Phe 


Gly Gin 


Ser 










65 










70 










75 


Asn 


Ser 


Thr 


Thr 


Trp 
80 


Phe 


Gin 


Thr 


Leu 


He 
85 


His 


Leu 


Leu 


Lys 


Gly 
90 


Asn 


He 


Gly 


Thr 


Gly 
95 


Leu 


Leu 


Gly 


Leu 


Pro 
100 


Leu 


Ala 


Val 


Lys 


Asn 
105 


Ala 


Gly 


He 


Val 


Met 


Gly 


Pro 


He 


Ser 


Leu 


Leu 


He 


He 


Gly 


He 








110 








115 










120 


Val 


Ala 


Val 


His 


Cys 
125 


Met 


Gly 


He 


Leu 


Val 
130 


Lys 


Cys 


Ala 


His 


His 
135 


Phe 


Cys 


Arg 


Arg 


Leu 
140 


Asn 


Lys 


Ser 


Phe 


Val 
145 


Asp 


Tyr 


Gly 


Asp 


Thr 
150 


Val 


Met 


Tyr 


Gly 


Leu 


Glu 


Ser 


Ser 


Pro 


Cys 


Ser 


Trp 


Leu 


Arg 


Asn 








155 










160 










165 


His 


Ala 


His 


Trp 


Gly 
170 


Arg 


Arg 


Val 


Val 


Asp 
175 


Phe 


Phe 


Leu 


He 


Val 
180 


Thr 


Gin Leu Gly 


Phe 


Cys 


Cys 


Val 


Tyr 


Phe 


Val 


Phe 


Leu 


Ala 


Asp 










185 










190 










195 


Asn 


Phe 


Lys 


Gin 


Val 


He 


Glu 


Ala 


Ala 


Asn 


Gly 


Thr 


Thr 


Asn 


Asn 








200 










205 










210 


Cys 


His 


Asn 


Asn 


Glu 


Thr 


Val 


He 


Leu 


Thr 


Pro 


Thr 


Met 


Asp 


Ser 








215 










220 










225 


Arg 


Leu 


Tyr 


Met 


Leu 


Ser 


Phe 


Leu 


Pro 


Phe 


Leu 


Val 


Leu 


Leu 


Val 






230 










235 










240 


Phe 


He 


Arg Asn 


Leu 


Arg 


Ala 


Leu 


Ser 


He 


Phe 


Ser 


Leu 


Leu 


Ala 










245 








250 










255 


Asn 


He 


Thr 


Met 


Leu 


Val 


Ser 


Leu 


Val 


Met 


He 


Tyr 


Gin 


Phe 


He 










260 










265 








270 


Val 


Gin Arg 


He 


Pro 


Asp 


Pro 


Ser 


His 


Leu 


Pro 


Leu 


Val 


Ala 


Pro 










275 








280 










285 


Trp 


Lys 


Thr 


Tyr 


Pro 


Leu 


Phe 


Phe 


Gly 


Thr 


Ala 


He 


Phe 


Ser 


Phe 








290 










295 










300 


Glu 


Gly 


He Gly 


Met 


Val 


Leu 


Pro 


Leu 


Glu 


Asn 


Lys 


Met 


Lys 


Asp 










305 










310 










315 


Pro 


Arg 


Lys 


Phe 


Pro 


Leu 


He 


Leu 


Tyr 


Leu 


Gly Met Val 


He 


Val 










320 










325 










330 


Thr 


He 


Leu 


Tyr 


He 


Ser 


Leu 


Gly Cys 


Leu 


Gly 


Tyr 


Leu 


Gin 


Phe 










335 










340 










345 


Gly 


Ala 


Asn 


He 


Gin 
350 


Gly 


Ser 


He 


Thr 


Leu 
355 


Asn 


Leu 


Pro 


Asn 


Cys 
360 
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Trp 


Leu 


Tyr 


Gin 


Ser 


Val 


Lys 


Leu 


Leu 


Tyr 


Ser 


He 


Gly 


He 


Phe 










365 










370 










375 


Phe 


Thr 


Tyr 


Ala 


Leu 


Gin 


Phe 


Tyr 


Val 


Pro 


Ala 


Glu 


He 


He 


He 








380 










385 










390 


Pro 


Phe 


Phe 


Val 


Ser 


Arg 


Ala 


Pro 


Glu 


Pro 


Cys 


Glu 


Leu 


Val 


Val 










395 










400 










405 


Asp 


Leu 


Phe 


Val 


Arg 


Pro 


Val 


Leu 


Val 


Cys 


Leu 


Thr 


Ser 


Leu 


Ser 










410 










415 










420 


Gly 


Ser 


Val 


Asp 


Asn 


Gly 


Trp 


Tyr 


Gly 


Thr 


Glu 


Ala Asp 


Gly 


Thr 










425 










430 










435 


Ser 


Cys 


Gly 


Ser 


Ala 


Pro 


Leu 


Val 


Phe 


Val 


Ser 


Ser 


Ser 


Phe 


Leu 








440 










445 










450 


Ala 


His 


Pro 


Trp 


Leu 


Ser 


Phe 


Arg 


Cys 


Glu 


Ser 


Gin 


Trp 


Val 


Ser 










455 








460 










465 


Cys 


His 


Arg 


Asp 


Thr 


Val 


Val 


Val 


Trp 


Gly 


Phe 


Ala 


Arg 


Gly 


He 










470 










475 










480 


Leu 


Ala 


He 


Leu 


He 


Pro 


Arg 


Leu 


Asp 


Leu 


Val 


He 


Ser 


Leu 


Val 










485 










490 










495 


Gly 


Ser 


Val 


Ser 


Ser 


Ser 


Ala 


Leu 


Ala 


Leu 


He 


lie 


Pro 


Pro 


Leu 










500 










505 










510 


Leu 


Glu 


Val 


Thr 


Thr 


Phe 


Tyr 


Ser 


Glu 


Gly 


Met 


Ser 


Pro 


Leu 


Thr 










515 










520 










525 


lie 


rue 


Lys 


Asp 


Ala 


Leu 


ne 


Ser 


lie 


Leu 


oiy 


Phe 


val 




irne 










530 










535 










540 


Val 


Val 


Gly 


Thr 


Tyr 


Glu 


Ala 


Leu 


Tyr 


Glu 


Leu 


He 


Gin 


Pro 


Ser 










545 










550 










555 


Asn 


Ala 


Pro 


He 


Phe 


He 


Asn 


Ser 


Thr 


Cys 


Ala 


Phe 


He 














560 
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Met 


Pro 


Val 


Arg 


Arg 


Gly 


His 


Val 


Ala 


Pro 


Gin 


Asn 


Thr 


Tyr 


Leu 


1 








5 










10 










15 


Asp 


Thr 


He 


He 


Arg 


Lys 


Phe 


Glu 


Gly 


Gin 


Ser 


Arg 


Lys 


Phe 


Leu 










20 










25 










30 


lie 


Ala 


Asn 


Ala 


Gin 


Met 


Glu 


Asn 


Cys 


Ala 


He 


He 


Tyr 


Cys 


Asn 










35 










40 










45 


Asp 


Gly 


Phe 


Cys 


Glu 


Leu 


Phe 


Gly 


Tyr 


Ser 


Arg 


Val 


Glu 


Val 


Met 










50 










55 










60 


Gin 


Gin 


Pro 


Cys 


Thr 


Cys 


Asp 


Phe 


Leu 


Thr 


Gly 


Pro 


Asn 


Thr 


Pro 










65 










70 










75 


Ser 


Ser 


Ala 


Val 


Ser 


Arg 


Leu 


Ala 


Gin 


Ala 


Leu 


Leu 


Gly Ala 


Glu 










80 










85 










90 


Glu 


Cys 


Lys 


Val 


Asp 


He 


Leu 


Tyr 


Tyr 


Arg 


Lys 


Asp 


Ala 


Ser 


Ser 










95 










100 










105 


Phe 


Arg 


Cys 


Leu 


Val 


Asp 


Val 


Val 


Pro 


Val 


Lys 


Asn 


Glu Asp 


Gly 










110 










115 










120 


Ala 


Val 


He 


Met 


Phe 


He 


Leu 


Asn 


Phe 


Glu 


Asp 


Leu 


Ala 


Gin 


Leu 










125 










130 








135 


Leu 


Ala 


Lys 


Cys 


Ser 


Ser 


Arg 


Ser 


Leu 


Ser 


Gin 


Arg 


Leu 


Leu 


Ser 










140 










145 










150 


Gin 


Ser 


Phe 


Leu 


Gly 


Ser 


Glu 


Gly 


Ser 


His 


Gly 


Arg 


Pro Gly 


Gly 










155 










160 










165 


Pro 


Gly Pro 


Gly 


Thr 


Gly 


Arg 


Gly 


Lys 


Tyr 


Arg 


Thr 


He 


Ser 


Gin 










170 










175 










180 


He 


Pro 


Gin 


Phe 


Thr 


Leu 


Asn 


Phe 


Val 


Glu 


Phe 


Asn 


Leu 


Glu 


Lys 










185 










190 










195 


His 


Arg 


Ser 


Ser 


Ser 


Thr 


Thr 


Glu 


He 


Glu 


He 


He 


Ala 


Pro 


His 










200 










205 










210 
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Lys 


Val 


val 


Glu 


Arg 
215 


Thr 


Gin 


Val 


Leu 


Ser 


Leu 


Gly 
230 


Ala 


Asp 


Ala 


Pro 


Arg 


He 


His 
245 


Arg 


Trp 


Lys 


Ala 


Val 


Trp 


Asp 
260 


Trp 


Leu 


Ala 


Val 


Phe 


Thr 


Pro 
275 


Tyr 


Ser 


Asp 


Glu 


Ser 


Arg 


Arg 
290 


Gly 


Ala 


Thr 


Val 


Val 


Asp 


Leu 


He 


Val 








305 






Val 


He 


Asn 


Phe 


Arg 
320 


Thr 


Thr 


Val 


Ser 


His 


Pro 


Arg 
335 


Arg 


He 


Phe 


Leu 


He 


Asp 


Met 
350 


Val 


Ala 


Phe 


Arg 


Thr 


Gly 


Ser 
365 


Asp 


Glu 


Lys 


Thr 


Ala 


Arg 


Leu 
380 


Leu 


Arg 


Asp 


Arg 


Tyr 


Ser 


Glu 
395 


Tyr 


Gly 


Cys 


Thr 


Phe 


Pro 


Leu 


He 


Ala 








410 






Ala 


He 


Gly 


Asn 


Val 
425 


Glu 


Arg 


Trp 


Leu 


Asp 


Ser 


Leu 
440 


Gly 


Val 


Ser 


Asp 


Pro 


Ala 


Ser 


Gly 


Pro 










455 




Ala 


Leu 


Tyr 


Phe 


Thr 
47 0 


Phe 


Ser 


Asn 


Val 


Ser 


Pro 


Asn 
485 


Thr 


Asn 


Val 


Met 


Leu 


He 


Gly 
500 


Ser 


Leu 


Val 


Ser 


Ala 


He 


He 


Gin 


Arg 










515 




His 


Thr 


Gin 


Met 


Leu 
530 


Arg 


Val 


He 


Pro 


Asn 


Pro 


Leu 
545 


Arg 


Gin 


Ala 


Trp 


Ser 


Tyr 


Thr 
560 


Asn 


Gly 


Gly 


Phe 


Pro 


Glu 


Cys 
575 


Leu 


Gin 


Arg 


Ala 


Leu 


Leu 


Gin 
590 


His 


Cys 


Gly 


Cys 


Leu 


Arg 


Ala 
605 


Leu 


Ala 


Pro 


Pro 


Gly 


Asp 


Thr 
620 


Leu 


Val 


Leu 


Tyr 


Phe 


He 


Ser 
635 


Arg 


Gly 


Val 


val 


Val 


Ala 


He 


Leu 


Gly 










650 




Val 


Ser 


Leu 


His 


Ala 
665 


Gin 


Pro 


Ala 


Leu 


Thr 


Tyr 


Cys 
680 


Asp 


Leu 


Leu 


Glu 


Val 


Leu 


Asp 
695 


Met 


Tyr 


Ser 


Lys 


Leu 


Glu 


Val 


Thr 


Phe 



Asn 


Val 


Thr 
220 


Glu 


Lys 


Val 


Thr 


Gin 

225 


Val 


Leu 


Pro 

235 


Glu 


Tyr 


Lys 


Leu 


Gin 
240 


Thr 


He 


Leu 


His 


Tyr 


Ser 


Pro 


Phe 






250 








255 


He 


Leu 


Leu 
265 


Leu 


Val 


He 


Tyr 


Thr 
270 


Ala 


Ala 


Phe 
280 


Leu 


Leu 


Ser 


Asp 


Gin 
285 


Cys 


Ser 


Tyr 
295 


Thr 


Cys 


Ser 


Pro 


Leu 
300 


Asp 


He 


Met 


Phe 


Val 


Val 


Asp 


He 




310 










315 


Tyr 


Val 


Asn 
325 


Thr 


Asn 


Asp 


Glu 


Val 
330 


Ala 


Val 


His 
340 


Tyr 


Phe 


Lys 


Gly 


Trp 
345 


Ala 


He 


Pro 
355 


Phe 


Asp 


Leu 


Leu 


He 
360 


Thr 


Thr 


Thr 


Leu 


He 


Gly 


Leu 


Leu 






370 








375 


Leu 


Val 


Arg 
385 


Val 


Ala 


Arg 


Lys 


Leu 
390 


Ala 


Ala 


Val 
400 


Leu 


Phe 


Leu 


Leu 


Met 
405 


His 


Trp 


Leu 
415 


Ala 


Cys 


He 


Trp 


Tyr 
420 


Pro 


Tyr 


Leu 
430 


Glu 


His 


Lys 


He 


Gly 
435 


Gin 


Leu 


Gly 
445 


Lys 


Arg 


Tyr 


Asn 


Gly 
450 


Ser 


Val 


Gin 
460 


Asp 


Lys 


Tyr 


Val 


Thr 
465 


Ser 


Leu 


Thr 


Ser 


Val 


Gly 


Phe 


Gly 






475 








480 


Ser 


Glu 


Lys 
490 


Val 


Phe 


Ser 


He 


Cys 
495 


Met 


Tyr 


Ala 


Ser 


He 


Phe 


Gly 


Asn 




505 










510 


Leu 


Tyr 


Ser 
520 


Gly 


Thr 


Ala 


Arg 


Tyr 
525 


Lys 


Glu 


Phe 

535 


He 


Arg 


Phe 


His 


Gin 
540 


Arg 


Leu 


Glu 
550 


Glu 


Tyr 


Phe 


Gin 


His 
555 


He 


Asp 


Met 
565 


Asn 


Ala 


Val 


Leu 


Lys 
570 


Ala 


Asp 


He 
580 


Cys 


Leu 


His 


Leu 


His 
585 


Pro 


Ala 


Phe 
595 


Ser 


Gly 


Ala 


Gly 


Lys 
600 


Val 


Lys 


Phe 
610 


Lys 


Thr 


Thr 


His 


Ala 
615 


His 


Leu 


Gly 
625 


Asp 


Val 


Leu 


Ser 


Thr 
630 


Ser 


He 


Glu 
640 


He 


Leu 


Arg 


Asp 


Asp 
645 


Lys 


Asn 


Asp 
655 


He 


Phe 


Gly 


Glu 


Pro 
660 


Gly 


Lys 


Ser 
670 


Ser 


Ala 


Asp 


Val 


Arg 
675 


His 


Lys 


He 
685 


Gin 


Arg 


Ala 


Asp 


Leu 
690 


Pro 


Ala 


Phe 
700 


Ala 


Glu 


Ser 


Phe 


Trp 
705 


Asn 


Leu 


Arg 


Asp 


Val 


Thr 


Gly 


Gly 
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710 


Leu 


His 


Ser 


Ser 


Pro 
725. 


Gly 


Phe 


Phe 


Leu 


Ser 
740 


Ser 


lie 


Ser 


Asp 


Ala 
755 


Met 


Pro 


Pro 


Lys 


His 
770 


Cys 


Trp 


Pro 


Leu 


Lys 
785 


Gin 


Met 


Asn 


Arg 


Leu 
800 


lie 


Leu 


Gin 


Leu 


Leu 
815 


Tyr 


He 


Leu 


Glu 


Ala 
830 


lie 


Ala 


Ser 


Glu 


Thr 
845 


Phe 


Leu 


Pro 


Pro 


Ala 
860 


Cys 


Ser 


Pro 


Lys 


His 
875 


Ala 


Val 


Ala 


Met 


Asp 
890 


Gin 


Pro 


Glu 


Gly 


Leu 
905 


Leu 


Glu 


Val 


Gin 


Gly 
920 


Pro 


Glu 


His 


Leu 


Gly 
935 


His 


Gly 


Ser 


Asp 


Pro 
950 
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Met 


Gly 


His 


Gin 


Gly 


1 








5 


Val 


He 


Ala 


Thr 


Trp 
20 


Leu 


Leu 


Gly 


Ala 


Leu 

35 


Arg 


Gly 


Arg 


Gin 


Thr 
50 


Pro 


Phe 


Ser 


Leu 


Leu 
65 


Pro 


Gin 


Ser 


Glu 


Gly 
80 


Ala 


Cys 


Leu 


Pro 


Pro 
95 


His 


Glu 


Glu 


Leu 


Ser 
110 


Thr 


Leu 


Thr 


Leu 


Asp 
125 


Pro 


Gly 


Val 


Leu 


Pro 
140 


Leu 


Leu 


Gly 


Arg 


Ala 
155 


Thr 


Pro 


Glu 


Arg 


Val 











715 


Arg 


Gin 


Ala 


Pro 


Gly 
730 


Asp 


Asn 


Gin 


Ser 


Asp 
745 


Phe 


Trp 


Leu 


Trp 


Pro 
760 


Ser 


Pro 


Gin 


Ser 


Pro 
775 


Leu 


Gly 


Ser 


Arg 


Leu 
790 


Glu 


Ser 


Arg 


Val 


Ser 
805 


Gin 


Lys 


Pro 


Met 


Pro 
820 


Pro 


Ala 


Ser 


Asn 


Asp 
835 


Thr 


Ser 


Pro 


Gly 


Pro 
850 


Gin 


Thr 


Pro 


Ser 


Tyr 
865 


Arg 


Asn 


Ser 


Ser 


Pro 
880 


Lys 


Thr 


Leu 


Ala 


Pro 
895 


Trp 


Pro 


Pro 


Leu 


Ala 
910 


Leu 


He 


Cys 


Gly 


Pro 
925 


Ser 


Val 


Pro 


Lys 


Gin 
940 


Gly 


Phe 


Ala 


Gly 


Ser 
955 
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Pro 


Phe 


Glu 


Glu 


Gly 
10 


Arg 


Arg 


Lys 


GlU 


Ala 
25 


Pro 


Ser 


Val 


Ser 


Cys 
40 


Tyr 


Gly 


Arg 


Ala 


Cys 
55 


Gly 


Pro 


Gin 


He 


Thr 
70 


Pro 


Gin 


Glu 


Gly 


Arg 
85 


Glu 


Arg 


Gly 


Leu 


Thr 
100 


Leu 


Asp 


Cys 


Leu 


Leu 
115 


Phe 


Cys 


Leu 


Trp 


Glu 
130 


Tyr 


Ala 


Gly 


Leu 


Thr 
145 


Leu 


Leu 


Ala 


Gly Val 










160 


Thr 


His 


His 


Val 


His 
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720 



Ser 


Gin 


Asp 


His 


Gin 










735 


Ala 


Ala 


Pro 


Pro 


Leu 










750 


Glu 


Leu 


Leu 


Gin 


Glu 










765 


Gin 


Glu 


Asp 


Pro 


Asp 










780 


Glu 


Gin 


Leu 


Gin 


Ala 










795 


Ser 


Asp 


Leu 


Ser 


Arg 










810 


Gin 


Gly 


His 


Ala 


Ser 










825 


Leu 


Ala 


Leu 


Val 


Pro 










840 


Arg 


Leu 


Pro 


Gin 


Gly 










855 


Gly 


Asp 


Leu 


Asp 


Asp 










870 


Arg 


Met 


Pro 


His 


Leu 










885 


Ser 


Ser 


GlU 


Gin 


Glu 










900 


Ser 


Pro 


Leu 


His 


Pro 










915 


Cys 


Phe 


Ser 


Ser 


Leu 










930 


Leu 


Asp 


Phe 


Gin 


Arg 










945 


Trp 


Gly 


His 






Asn 


Gly 


Gly 


Leu 


Arg 










15 


Trp 


Arg 


Arg 


Asp 


Cys 










30 


Gly 


Gly 


Trp 


Gly 


His 










45 


Gly 


Val 


Lys 


Glu 


Lys 










60 


Val 


Tyr 


Ala 


Val 


Trp 










75 


Leu 


Arg 


Val 


Asn 


Ser 










90 


Asn 


Ala 


Cys 


Thr 


Asn 










105 


Phe 


GlU 


Asn 


Val 


Asn 










120 


Lys 


Thr 


Thr 


He 


Val 










135 


Leu 


Gin 


Ser 


Lys 


Phe 










150 


His 


Val 


He 


Thr 


Leu 










165 


Gly 


Trp 


Tyr 


Met 


Glu 
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170 










175 










180 


Asp 


Gly 


Phe 


Lys 


Gly Asp Arg 


Thr 


Glu 


Gly 


Cys 


Arg 


Ser 


Asp 


Ser 










185 










190 










195 


Val 


Ala 


Val 


Pro 


Ala 
200 


Ala 


Ala 


Pro 


Val 


Cys 
205 


Gin 


Pro 


Lys 


Ser 


Ala 
210 


Thr 


Asn Gly Gin 


Pro 


Pro 


Ala 


Pro 


Ala 


Pro 


Thr 


Pro 


Thr 


Pro 


Arg 










215 










220 










225 


Leu 


Ser 


He 


Ser 


Ser 


Arg 


Ala 


Thr 


Val 


Val 


Ala Arg Met 


Glu 


Gly 










230 










235 










240 


Thr 


Ser Gin Gly Gly 


Leu 


Gin 


Thr 


Val 


Met 


Lys 


Trp 


Lys 


Thr 


Val 










245 










250 










255 


Val 


Ala 


He 


Phe 


Val 


Val 


Val 


Val 


Val 


Tyr 


Leu 


Val 


Thr 


Gly Gly 










260 










265 










270 


Leu 


Val 


Phe Arg 


Ala 


Leu 


Glu 


Gin 


Pro 


Phe 


Glu 


Ser 


Ser 


Gin 


Lys 










275 










280 










285 


Asn 


Thr 


He 


Ala 


Leu 


Glu 


Lys 


Ala 


Glu 


Phe 


Leu Arg 


Asp 


His 


Val 










290 










295 










300 


Cys 


Val 


Ser 


Pro 


Gin 


Glu 


Leu 


Glu 


Thr 


Leu 


He 


Gin 


His 


Ala 


Leu 








305 










310 










315 


Asp 


Ala 


Asp 


Asn 


Ala 


Gly Val 


Ser 


Pro 


He 


Gly Asn 


Ser 


Ser 


Asn 








320 










325 










330 


Asn 


Ser 


Ser 


His 


Trp 


Asp 


Leu Gly 


Ser 


Ala 


Phe 


Phe 


Phe 


Ala 


Gly 










335 








340 










345 


Thr 


Val 


He 


Thr 


Thr 


Met 


Tyr 


Gly Asn 


He 


Ala 


Pro 


Ser 


Thr 


Glu 










350 








355 










360 


Gly 


Gly Lys 


He 


Phe 


Cys 


He 


Leu 


Tyr 


Ala 


He 


Phe 


Gly 


He 


Pro 








365 










370 










375 


Leu 


Phe 


Gly 


Phe 


Leu 


Leu Ala Gly 


He 


Gly Asp 


Gin 


Leu Gly Thr 








380 










385 










390 


He 


Phe 


Gly 


Lys 


Ser 
395 


He 


Ala 


Arg 


Val 


Glu 
400 


Lys 


Val 


Phe 


Arg 


Lys 
405 


Lys 


Gin 


Val 


Ser 


Gin 


Thr 


Lys 


He 


Arg 


Val 


He 


Ser 


Thr 


He 


Leu 








410 










415 










420 


Phe 


He 


Leu 


Ala 


Gly 
425 


Cys 


He 


Val 


Phe 


Val 
430 


Thr 


He 


Pro 


Ala 


Val 
435 


He 


Phe 


Lys 


Tyr 


He 
440 


Glu 


Gly 


Trp 


Thr 


Ala 
445 


Leu 


Glu 


Ser 


He 


Tyr 
450 


Phe 


Val 


Val 


Val 


Thr 


Leu 


Thr 


Thr 


Val 


Gly 


Phe 


Gly Asp 


Phe 


Val 










455 










460 










465 


Ala 


Val 


Val 


Val 


Phe 
470 


Arg 


Gly 


Asn 


Ala 


Gly 
475 


He 


Asn 


Tyr 


Arg 


Glu 
480 


Trp 


Tyr 


Lys 


Pro 


Leu 


Val 


Trp 


Phe 


Trp 


He 


Leu 


Val 


Gly 


Leu 


Ala 








485 










490 










495 


Tyr 


Phe 


Ala 


Ala 


Val 


Leu 


Ser 


Met 


He 


Gly Asp 


Trp 


Leu Arg 


Val 








500 










505 










510 


Leu 


Ser 


Lys 


Lys 


Thr 


Lys 


Glu 


GlU 


Val 


Gly Glu 


He 


Lys 


Ala 


His 






515 










520 










525 


Ala 


Ala 


Glu 


Trp 


Lys 


Ala 


Asn 


Val 


Thr 


Ala 


Glu 


Phe 


Arg 


Glu 


Thr 








53 0 










535 










540 


Arg 


Arg Arg 


Leu 


Ser 


Val 


Glu 


He 


His 


Asp 


Lys 


Leu 


Gin Arg Ala 








545 










550 










555 


Ala 


Thr 


He Arg 


Ser 


Met 


Glu 


Arg 


Arg Arg 


Leu Gly Leu Asp 


Gin 










560 










565 










570 


Arg 


Ala 


His 


Ser 


Leu 


Asp 


Met 


Leu 


Ser 


Pro 


Glu 


Lys 


Arg 


Ser 


Val 








575 










580 










585 


Phe 


Ala 


Ala 


Leu 


Asp 


Thr 


Gly Arg 


Phe 


Lys 


Ala 


Ser 


Ser 


Gin 


Glu 










590 










595 










600 


Ser 


He 


Asn 


Asn 


Arg 
605 


Pro 


Asn 


Asn 


Leu 


Arg 
610 


Leu 


Lys 


Gly 


Pro 


Glu 
615 


Gin 


Leu 


Asn 


Lys 


His 


Gly Gin Gly Ala 


Ser 


Glu 


Asp 


Asn 


He 


He 










620 










625 










630 


Asn 


•Lys 


Phe 


Gly 


Ser 


Thr 


Ser 


Arg 


Leu 


Thr 


Lys 


Arg 


Lys 


Asn 


Lys 






63 5 










640 










645 


Asp 


Leu 


Lys 


Lys 


Thr 


Leu 


Pro 


Glu 


Asp 


Val 


Gin 


Lys 


He 


Tyr 


Lys 








650 










655 










660 


Thr 


Phe 


Arg 


Asn 


Tyr 
665 


Ser 


Leu 


Asp 


Glu 


Glu 
670 


Lys 


Lys 


Glu 


Glu 


Glu 
675 
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Thr 


Glu 


Lys 


Met 


Cys 


Asn 


Ser 


Asp 


Asn 


Ser 


Ser Thr Ala Met 


Leu 










680 










685 




690 


Thr 


Asp 


Cys 


lis 


Gin 


Gin 


His 


Ala 


Glu 


Leu 


Glu Asn Gly Met 


lie 










695 










700 


705 


Pro 


Thr 


Asp 


Thr 


Lys 


Asp 


Arg 


Glu 


Pro 


Glu 


Asn Asn Ser Leu 


Leu 










710 










715 




720 


Glu 


Asp 


Arg 


Asn 
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Met 


Tyr 


Asn 


Glu 


He 


Leu 


Met 


Leu 


Gly Ala 


Lys 


Leu 


His 


Pro 


Thr 


1 








5 










10 








15 


Leu 


Lys 


Leu 


Glu 


Glu 


Leu 


Thr 


Asn 


Lys 


Lys 


Gly Met 


Thr 


Pro 


Leu 










20 










25 










30 


Ala 


Leu 


Ala 


Ala 


Gly 


Thr 


Gly 


Lys 


He 


Gly 


Asn Arg 


His 


Asp 


Met 










35 
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170 








Lys 


Gly Met 


His 


Ala 


Tyr 


Arg 


Lys 










185 








He 


Ala 


Val 


Lys 


Val 


Pro 


Ser 


Val 










200 








Val 


Leu 


Asn 


Pro 


Phe 


Tyr 


He 


Phe 










215 








Trp 


Ser 


Thr 


Asp 


Glu 


Tyr 


Tyr 


Tyr 










230 








Met 


Ser 


He 


Val 


Ser 


He 


Val 


Ser 










245 








Gin 


Tyr 


Val 


Met 


Leu 


His 


Asp 


Met 










260 








Arg 


Val 


Ser 


Val 


Cys 


Arg Val 


Asn 










275 








Ser 


Thr 


Asp 


Leu 


Val 


Pro 


Gly 


Asp 










290 








Gly 


Thr 


He 


Met 


Pro 


Cys 


Asp 


Ala 










305 






He 


Val 


Asn 


Glu 


Ser 


Met 


Leu 


Thr 



320 



Thr 


Ser 


Pro 


Gin 


Ala 


Tyr 


Gly 




565 










570 


Glu 


Cys 


He 


Val 


Asn 


Gin 


Asp 




580 










585 


Gin 


Ala 


Ser 


Ser 


Pro 


Cys 


Ser 




595 










600 


Gin 


Lys 


Ala 


Ser 


Pro 


Gin 


Thr 




610 










615 


Gly 


Asp 


Leu 


Gly 


Pro 


Gly 


Arg 




625 










630 



He 


Asn 


Gin 


Gly Gin 


Glu 


Asp 




10 










15 


Leu 


Ser 


Arg 


Trp 


Lys 


Leu 


Ala 




25 










30 


Ser 


Gly Gly Val 


Ser 


Pro 


Pro 




40 










45 


Arg 


Val 


Lys 


Ala 


Thr 


Cys 


Val 




55 










60 


Val 


Val 


Leu 


Leu 


Arg 


Thr 


Thr 




70 










75 


Ala 


Lys 


He 


Arg 


Val 


Leu 


Ser 




85 










90 


Pro 


Lys 


Ser 


Met 


Ser 


Asn 


Lys 




100 










105 


Leu 


He 


Glu 


Asn 


Pro 


Thr 


Glu 




115 










120 


Tyr 


Ser 


Gin 


Thr 


Glu 


Ser 


Gin 




13 0 










135 


Ser 


Val 


Lys 


Tyr 


Phe 


Trp 


Asn 




145 










150 


Leu 


Lys 


Gly 


Leu 


Asp 


Glu 


Gly 




160 










165 


Lys 


His 


Ser 


Ala 


Gly 


Leu 


Thr 




175 










180 


Leu 


Leu 


Tyr 


Gly Val 


Asn 


Glu 




190 










195 


Phe 


Lys 


Leu 


Leu 


He 


Lys 


Glu 




205 










210 


Gin 


Leu 


Phe 


Ser 


Val 


He 


Leu 




220 










225 


Tyr 


Ala 


Leu 


Ala 


He 


Val 


Val 




235 










240 


Ser 


Leu 


Tyr 


Ser 


He Arg 


Lys 




250 










255 


Val 


Ala 


Thr 


His 


Ser 


Thr 


Val 




265 










270 


Glu 


Glu 


He 


Glu 


Glu 


He 


Phe 




280 










285 


Val 


Met 


Val 


He 


Pro 


Leu 


Asn 




295 










300 


Val 


Leu 


He 


Asn 


Gly Thr 


Cys 




310 










315 


Gly 


Glu 


Ser 


Val 


Pro 


Val 


Thr 


325 










330 
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Lys 


Thr 


Asn 


Leu 


Pro 


Asn 


Pro 


Ser 


Val 


Asp 


Val 


Lys 


Gly 


He 


Gly 










335 










340 










345 


Asp 


Glu 


Leu Tyr 


Asn 


Pro 


Glu 


Thr 


His 


Lys 


Arg 


His 


Thr 


Leu 


Phe 










350 










355 










360 


Cys 


Gly 


Thr 


Thr 


Val 


He 


Gin 


Thr 


Arg 


Phe 


Tyr 


Thr 


Gly 


Glu 


Leu 








365 










370 










375 


Val 


Lys 


Ala 


He 


Val 


Val 


Arg 


Thr 


Gly 


Phe 


Ser 


Thr 


Ser 


Lys 


Gly 










380 










385 










390 


Gin 


Leu 


Val 


Arg 


Ser 


He 


Leu 


Tyr 


Pro 


Lys 


Pro 


Thr 


Asp 


Phe 


Lys 










395 










400 










405 


Leu 


Tyr 


Arg 


Asp 


Ala 


Tyr 


Leu 


Phe 


Leu 


Leu 


Cys 


Leu 


Val 


Ala 


Val 










410 










415 










420 


Ala 


Gly 


He 


Gly 


Phe 


He 


Tyr 


Thr 


He 


He 


Asn 


Ser 


He 


Leu 


Asn 










425 










430 










435 


Glu 


Val 


Gin 


Val 


Gly 


Val 


He 


He 


He 


Glu 


Ser 


Leu 


Asp 


He 


He 










440 










445 










450 


Thr 


He 


Thr 


Val 


Pro 


Pro 


Ala 


Leu 


Pro 


Ala 


Ala 


Met 


Thr 


Ala 


Gly 










455 










460 










465 


He 


Val 


Tyr 


Ala 


Gin 


Arg 


Arg 


Leu 


Lys 


Lys 


He 


Gly 


He 


Phe 


Cys 










470 










475 










480 


He 


Ser 


Pro 


Gin 


Arg 


He 


Asn 


He 


Cys 


Gly 


Gin 


Leu 


Asn 


Leu 


Val 










485 










490 










495 


Cys 


Phe 


Asp 


Lys 


Thr 


Gly 


Thr 


Leu 


Thr 


Glu 


Asp 


Gly 


Leu 


Asp 


Leu 










500 










505 










510 


Trp 


Gly 


He 


Gin 


Arg 


Val 


Glu 


Asn 


Ala 


Arg 


Phe 


Leu 


Ser 


Pro 


Glu 








515 










520 










525 


Glu 


Asn 


Val 


Cys 


Asn 


Glu 


Met 


Leu 


Val 


Lys 


Ser 


Gin 


Phe 


Val 


Ala 








530 










535 










540 


Cys 


Met 


Ala 


Thr 


Cys 


His 


Ser 


Leu 


Thr 


Lys 


He 


Glu 


Gly 


Val 


Leu 










545 










550 










555 


Ser 


Gly 


Asp 


Pro 


Leu 


Asp 


Leu 


Lys 


Met 


Phe 


Glu 


Ala 


He 


Gly 


Trp 










560 










565 










570 


He 


Leu 


Glu 


Glu 


Ala 


Thr 


Glu 


Glu 


Glu 


Thr 


Ala 


Leu 


His 


Asn 


Arg 










575 










580 










585 


He 


Met 


Pro 


Thr 


Val 


Val 


Arg 


Pro 


Pro 


Lys 


Gin 


Leu 


Leu 


Pro 


Glu 










5£0 








595 










600 


Ser 


Thr 


Pro 


Ala 


Gly 


Asn 


Gin 


Glu 


Met 


Glu 


Leu 


Phe 


Glu 


Leu 


Pro 










605 










610 










615 


Ala 


Thr 


Tyr Glu 


He 


Gly 


He 


Val 


Arg 


Gin 


Phe 


Pro 


Phe 


Ser 


Ser 










620 










625 










63 0 


Ala 


Leu 


Gin Arg 


Met 


Ser 


Val 


Val 


Ala 


Arg 


Val 


Leu 


Gly 


Asp 


Arg 










635 










640 










645 


Lys 


Met 


Asp Ala 


Tyr 


Met 


Lys 


Gly 


Ala 


Pro 


Glu 


Ala 


He 


Ala 


Gly 










650 










655 










660 


Leu 


Cys 


Lys 


Pro 


Glu 


Thr 


Val 


Pro 


Val 


Asp 


Phe 


Gin 


Asn 


Val 


Leu 






665 










670 










675 


Glu 


Asp 


Phe 


Thr 


Lys 


Gin 


Gly 


Phe 


Arg 


Val 


He 


Ala 


Leu 


Ala 


His 










680 










685 










690 


Arg 


Lys 


Leu 


Glu 


Ser 


Lys 


Leu 


Thr 


Trp 


His 


Lys 


Val 


Gin 


Asn 


He 










695 










700 










705 


Ser 


Arg 


Asp 


Ala 


He 


Glu 


Asn 


Asn 


Met 


Asp 


Phe 


Met 


Gly 


Leu 


He 










710 










715 










720 


He 


Met 


Gin 


Asn 


Lys 


Leu 


Lys 


Gin 


Lys 


Thr 


Pro 


Ala 


Val 


Leu 


Glu 










725 










730 










735 


Asp 


Leu 


His 


Lys 


Ala 


Asn 


He 


Arg 


Thr 


Val 


Met 


Val 


Thr 


Gly 


Asp 








740 










745 










750 


Ser 


Met 


Leu 


Thr 


Ala 


Val 


Ser 


Val 


Ala 


Arg 


Asp 


Cys 


Gly 


Met 


He 










755 










760 










765 


Leu 


Pro 


Gin Asp 


Lys 


Val 


He 


He 


Ala 


Glu 


Ala 


Leu 


Pro 


Pro 


Lys 










770 










775 










780 


Asp 


Gly 


Lys 


Val 


Ala 


Lys 


He 


Asn 


Trp 


His 


Tyr 


Ala 


Asp 


Ser 


Leu 










785 










790 










795 


Thr 


Gin 


Cys 


Ser 


His 


Pro 


Ser 


Ala 


He 


Asp 


Pro 


Glu 


Ala 


He 


Pro 








800 










805 










810 


Val 


Lys 


Leu 


Val 


His 


Asp 


Ser 


Leu 


Glu 


Asp 


Leu 


Gin 


Met 


Thr 


Arg 










815 










820 










825 


Tyr 


His 


Phe 


Ala 


Met 


Asn 


Gly 


Lys 


Ser 


Phe 


Ser 


Val 


He 


Leu 


Glu 
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830 



His 


Phe 


Gin 


Asp Leu 
845 


Val 


Pro 


Lys 


Phe 


Ala 


Arg 


Met Ala 
860 


Pro 


Asp 


Gin 


Leu 


Gin 


Asn 


Val Asp 
875 


Tyr 


Phe 


Val 


Asn 


Asp 


Cys 


Gly Ala 
890 


Leu 


Lys 


Arg 


Ser 


Glu 


Leu 


Glu Ala 
905 


Ser 


Val 


Ala 


Pro 


Ser 


He 


Ser Cys 
920 


Val 


Pro 


Asn 


Ala 


Leu 


He 


Thr Ser 
935 


Phe 


Cys 


Val 


Ser 


lie 


He 


Gin Tyr 
950 


Phe 


Ser 


Val 


Ser 


Asn 


Leu 


Gly Asp 
965 


Phe 


Gin 


Phe 


lie 


Leu 


Val 


Val Val 
980 


Phe 


Thr 


Met 


GlU 


Leu 


Val 


Ala Gin 


Arg 


Pro 


Pro 








995 






Leu 


Leu 


Phe 


Ser Val 
1010 


Leu 


Ser 


Gin 


Gin 


Ser 


Leu 


Gly Phe 
1025 


Phe 


Trp 


Val 


Val 


Trp 


His 


Pro Lys 
1040 


Ser 


Asp 


Ala 


Phe 


Trp 


Asn 


Ser Ser 


His 


Val 


Asp 








1055 






His 


Asn 


lie 


Gin Asn 
1070 


Tyr 


Glu 


Asn 


Ser 


Phe 


Gin 


Tyr Leu 
1085 


He 


Val 


Ala 


Pro 


Phe 


Arg 


Gin Pro 
1100 


Cys 


Tyr 


Lys 


Val 


lie 


Phe 


Leu Tyr 
1115 


He 


Phe 


He 


Val 


Ala 


Ser 


Val Asp 
1130 


Gin 


Val 


Leu 


Gin 


Trp 


Arg 


Val Thr 
1145 


Met 


Leu 


He 


Val 


Ser 


He 


Thr Val 
1160 


Glu 


Asn 


Phe 


Lys 


Val 


Val 


Phe Asn 
1175 


Arg 


Asp 


Lys 


Thr 


Thr 


Gin 


Pro Pro 
1190 


Gin 


Glu 


Ser 


Cys 


Leu 


Pro 


Trp Ala 


Leu 


Gly Cys 








1205 








Lys 


Tyr 


Met 


Tyr Leu 
1220 


Ala 


Gin 


Glu 


Pro 


Pro 


Lys 


Pro Gin 
1235 


Thr 


Thr 


Thr 


GlU 


Asn 


Gly 


Ser Cys 


Gin 


He 


He 
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835 








840 


Leu Met 


Leu 


His 


Gly 


Thr Val 


850 






855 


Lys Thr 


Gin 


Leu 


He 


Glu Ala 


865 








870 


Gly Met 


Cys 


Gly 


Asp 


Gly Ala 


880 








885 


Ala His 


Gly 


Gly 


He 


Ser Leu 


895 








900 


Ser Pro 


Phe 


Thr 


Ser 


Lys Thr 


910 








915 


Leu He 


Arg 


Glu 


Gly 


Arg Ala 


925 








930 


Phe Lys 


Phe 


Met 


Ala 


Leu Tyr 


940 








945 


Thr Leu 


Leu 


Tyr 


Ser 


He Leu 


955 








960 


Leu Phe 


He 


Asp 


Leu 


Ala He 


970 








975 


Ser Leu 


Asn 


Pro 


Ala 


Trp Lys 


985 








990 


Ser Gly 


Leu 


He 


Ser 


Gly Ala 


1000 








1005 


He He 


He 


Cys 


He 


Gly Phe 


1015 








1020 


Lys Gin 


Gin 


Pro 


Trp 


Tyr Glu 


1030 








1035 


Cys Asn 


Thr 


Thr 


Gly 


Ser Gly 


1045 








1050 


Asn Glu 


Thr 


Glu 


Leu 


Asp Glu 


1060 








1065 


Thr Thr 


Val 


Phe 


Phe 


lie Ser 


1075 








1080 


He Ala 


Phe 


Ser 


Lys 


Gly Lys 


1090 








1095 


Asn Tyr 


Phe 


Phe 


Val 


Phe Ser 


1105 








1110 


Leu Phe 


He 


Met 


Leu 


Tyr Pro 


1120 








1125 


Gin He 


Val 


Cys 


Val 


Pro Tyr 


1135 








1140 


He Val 


Leu 


Val 


Asn 


Ala Phe 


1150 








1155 


Phe Leu 


Asp 


Met 


Val 


Leu Trp 


1165 








1170 


Gin Gly 


Glu 


Tyr 


Arg 


Phe Ser 


1180 








1185 


Val Asp 


Arg 


Trp 


Gly 


Lys Cys 


1195 








1200 


Arg Lys 


Lys 


Thr 


Pro 


Lys Ala 


1210 








1215 


Leu Leu 


Val 


Asp 


Pro 


Glu Trp 


1225 








1230 


Glu Ala 


Lys 


Ala 


Leu 


Val Lys 


1240 








1245 


Thr He 


Thr 








1255 
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<400> 15 
Met Leu His 
1 

Leu Leu Leu 
Phe Gly Tyr 
Gin Glu Phe 
Leu Pro Asp 
Leu Tyr Pro 
Leu Ala lie 
He Phe Val 
Ala Gly Ser 
Val Asn Ala 
Glu Ser Ala 
Ala He Phe 
Leu Arg Glu 
Ala Ser Cys 
Leu Leu Pro 
Thr Glu Ala 
Asp Leu Ala 
Cys Gin Gly 
Arg Ala Leu 
Ala Met Glu 
Ser Val Phe 
Ala He He 
Ser Cys Val 
Gly Gly Tyr 
Ala Leu Cys 
Met Ala Cys 
Ala Gly Val 
Ala Arg Pro 
Met Leu He 
Leu Ser His 
Gly Ala He 
Thr Phe Gin 
Arg Arg Ala 



Ala 


Leu 


Leu 




5 




Thr 


He 


Cys 




20 




Asn 


Leu 


Ser 




35 




Thr 


Asn 


Glu 




50 




His 


Leu 


Val 




65 




Leu 


Gly 


Gly 




80 




Thr 


Leu 


Gly 




95 




Val 


Ser 


Ala 




110 




Phe 


Glu 


Met 




125 




Gly 


Val 


Ser 




140 




Pro 


Lys 


Glu 




155 




Thr 


Ala 


Leu 




170 




Leu 


Leu 


Gly 




185 




Leu 


Val 


Pro 




200 




Glu 


Ser 


Pro 




215 




Cys 


Leu 


Ala 




230 




Gly 


Glu 


Leu 




245 




Cys 


Arg 


Ala 




260 




Arg 


Arg 


Gin 




275 




Leu 


Cys 


Gly 




290 




Arg 


Lys 


Ala 




305 




Gly 


Thr 


Gly 




320 




Val 


He 


Glu 




335 




Ser 


Leu 


Met 




350 




Leu 


Gin 


Ser 




365 




He 


Phe 


Ala 




380 




Thr 


Gly 


He 




395 




Ala 


Ala 


Cys 




410 




Leu 


Val 


Gly 




425 




Phe 


Leu 


Tyr 




440 




Tyr 


Thr 


Gly 




455 




Glu 


He 


Ser 




470 




Gin 


Gly 


Pro 




485 





Arg 


Ser 


Arg 


Aia 




taiy 


i le 


lie 


Asn 


Thr 


Trp 


pin 


Leu 


Leu 


Mec 


Leu 


riie 


c^iy 


Arg 


Lys 


Lys 




±xe 


Leu 


i ie 




Leu 


Men 


Asn 


T 1 /=» 

lie 


Leu 


Arg 


L*iy 


L?iy 


x±e 


val 


Gly 


Pro 


Gin 


Gly 


Ala 


Leu 


Arg 


Tyr 


Leu 


Ala 


Leu 


Arg 


Glu 


/"•ill 


Leu 


Arg 


Arg 


Pro 


vai 


rr>V» -y~ 

mr 


Ser 


Asn 


Asp 


S er 




val 


Pro 


Ser 


Cys 


talU 


Arg 


val 


Giiy 


Thr 


Cys 


Trp 


Ser 


rue 


Pro 


ir lie 


± x e 


Leu 


Leu 




inr 


1X1 eu 


va J_ 


Cys 


Leu 




±rllt= 


Val 


Pro 


Phe 


Leu 


Phe 


Leu 


Lys 


Glu 


Leu 


Thr 


Trp 


Arg 



Thr He Gin 
10 

He Gly Gly 
25 

Ala Pro Thr 
40 

Ala Arg Thr 
55 

Trp Ser Leu 
70 

Ala Leu Leu 
85 

Ser Leu Leu 
100 

Phe Gly Phe 
115 

Gly Arg Leu 
130 

Gin Pro Met 
145 

Ala Val Ala 
160 

Met Gly Gin 
175 

Ala Trp Pro 
190 

Gin Leu Ala 
205 

Leu He Asp 

220 

Gin Leu Arg 

235 

Glu Glu Glu 
250 

Trp Glu Leu 
265 

Leu Val Val 
280 

Val Tyr Ala 
295 

Glu Ala Lys 
310 

Leu Leu Thr 
325 

Arg Arg Val 
340 

Gly Ser He 
355 

Trp Thr Leu 
370 

Ser Phe Gly 
385 

Glu Leu Phe 
400 

Gly Ala Leu 
415 

Pro Phe He 
430 

Leu Gly Val 
445 

Pro Glu Thr 
460 

His Arg Leu 
475 

Ser Leu Glu 
490 



Gly Arg He 
15 

Thr Phe Gin 
30 

Leu His He 
45 

Gly Glu Pro 
60 

He Val Ser 
75 

Ala Gly Pro 
90 

Val Asn Asn 
105 

Ser Arg Lys 
120 

Leu Val Gly 
135 

Tyr Leu Gly 
150 

Met Ser Ser 
165 

Val Val Gly 
180 

Leu Leu Leu 
195 

Ser Leu Pro 
210 

Cys Gly Asp 
225 

Gly Ser Gly 
240 

Arg Ala Ala 
255 

Phe Gin His 
270 

Leu Gly Ser 
285 

Tyr Ala Ser 
300 

He Gin Tyr 
315 

Ala Val Val 
330 

Leu Leu He 
345 

Phe Thr Val 
360 

Tyr Leu Ala 
375 

He Gly Pro 
390 

Asp Gin Met 
405 

Met Trp He 
420 

Met Glu Ala 
435 

Cys Val Cys 
450 

Lys Gly Lys 
465 

Asn Phe Pro 
480 

Val He Gin 
495 
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Ser Thr Glu Leu 
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<220> 
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<400> 16 


















Met 


Ala 


Ala 


Asn 


Ser 


Thr 


Ser Asp 


Leu 


His 


Thr Pro Gly Thr 


Gin 


1 








5 






10 




15 


Leu 


Ser 


Val 


Ala 


Asp 


He 


He Val 


He 


Thr 


Val Tyr Phe Ala 


Leu 










20 








25 




30 


Asn 


Val 


Ala 


Val 


Gly 


He 


Trp Ser 


Ser 


Cys 


Arg Ala Ser Arg 


Asn 










35 








40 




45 


Thr 


Val 


Asn 


Gly 


Tyr 


Phe 


Leu Ala Gly Arg 


Asp Met Thr Trp 


Trp 










50 








55 




60 


Pro 


He 


Gly 


Ala 


Ser 


Leu 


Phe Ala 


Ser 


Ser 


Glu Gly Ser Gly 


Leu 








65 








70 




75 


Phe 


He 


Gly 


Leu 


Ala 


Gly 


Ser Gly Ala 


Ala 


Gly Gly Leu Ala 


Val 










80 








85 




90 


Ala 


Gly 


Phe 


Glu 


Trp 


Asn 


Ala Thr 


Tyr 


Val 


Leu Leu Ala Leu 


Ala 










95 








100 




105 


Trp 


Val 


Phe 


Val 


Pro 


He 


Tyr He 


Ser 


Ser 


Glu He Val Thr 


Leu 










110 






115 




120 


Pro 


Glu 


Tyr 


He 


Gin 


Lys 


Arg Tyr 


Gly Gly Gin Arg He Arg 


Met 










125 








130 




135 


Tyr 


Leu 


Ser 


Val 


Leu 


Ser 


Leu Leu 


Leu 


Ser 


Val Phe Thr Lys 


He 








140 








145 




150 


Ser 


Leu 


Asp 


Leu 


Tyr 


Ala 


Gly Ala 


Leu 


Phe 


Val His He Cys 


Leu 










155 








160 




165 


Gly 


Trp 


Asn 


Phe 


Tyr 


Leu 


Ser Thr 


He 


Leu 


Thr Leu Gly He 


Thr 










170 








175 




180 


Ala 


Leu 


Tyr 


Thr 


He 


Ala 


Gly Gly 


Leu 


Ala 


Ala Val He Tyr 


Thr 










185 








190 




195 


Asp 


Ala 


Leu 


Gin 


Thr 


Leu 


He Met 


Val 


Val 


Gly Ala Val He 


Leu 










200 








205 




210 


Thr 


He 


Lys 


Ala 


Phe 


Asp 


Gin He 


Gly 


Gly 


Tyr Gly Gin Leu 


Glu 










215 








220 




225 


Ala 


Ala 


Tyr 


Ala 


Gin 


Ala 


He Pro 


Ser 


Arg 


Thr He Ala Asn 


Thr 








230 








235 




240 


Thr 


Cys 


His 


Leu 


Pro 


Arg 


Thr Asp 


Ala 


Met 


His Met Phe Arg 


Asp 










245 








250 




255 


Pro 


His 


Thr 


Gly Asp 


Leu 


Pro Trp 


Thr 


Gly Met Thr Phe Gly 


Leu 










260 








265 




270 


Thr 


He 


Met 


Ala 


Thr 


Trp 


Tyr Trp 


Cys 


Thr 


Asp Gin Val He 


Val 










275 








280 




285 


Gin 


Arg 


Ser 


Leu 


Ser 


Ala 


Arg Asp 


Leu 


Asn 


His Ala Lys Ala 


Gly 










290 








295 




300 


Ser 


He 


Leu 


Ala 


Ser 


Tyr 


Leu Lys 


Met 


Leu 


Pro Met Gly Leu 


He 










305 






310 




315 


He 


Met 


Pro 


Gly 


Met 


He 


Ser Arg 


Ala 


Leu 


Phe Pro Asp Asp 


Val 










320 








325 




330 


Gly 


Cys 


Val 


Val 


Pro 


Ser 


Glu Cys 


Leu 


Arg 


Ala Cys Gly Ala 


Glu 










335 








340 




345 


Val 


Gly 


Cys 


Ser 


Asn 


He 


Ala Tyr 


Pro 


Lys 


Leu Val Met Glu 


Leu 










350 








355 




360 


Met 


Pro 


He 


Gly 


Leu 


Arg 


Gly Leu 


Met 


He 


Ala Val Met Leu 


Ala 








365 






370 




375 


Ala 


Leu 


Met 


Ser 


Ser 


Leu 


Thr Ser 


He 


Phe 


Asn Ser Ser Ser 


Thr 










380 








385 




390 


Leu 


Phe 


Thr 


Met 


Asp 


He 


Trp Arg 


Arg 


Leu 


Arg Pro Arg Ser 


Gly 










395 








400 




405 
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Glu 


Arg 


Glu 


Leu 


Leu 
410 


Leu 


Val 


Gly 


lie 


Gly 


Val 


Ser 


Val 
425 


Ala 


Trp 


He 


Ser 


Gly 


Gin 


Leu 


Phe 
440 


He 


Tyr 


Met 


Ala 


Pro 


Pro 


Val 


Thr 
455 


Ala 


Val 


Phe 


Arg 


Ala 


Asn 


Glu 


Gin 


Gly 


Ala 


Phe 










470 






Val 


Val 


Gly Ala 


Thr 


Arg 


Leu 


Val 










485 








Pro 


Pro 


Cys 


Gly 


Glu 
500 


Pro 


Asp 


Thr 


He 


His 


Tyr 


Leu 


His 
515 


Phe 


Ala 


Val 


Ala 


Val 


Val 


Val 


Ala 
530 


Gly 


Ser 


Leu 


Val 


Gin 


He 


Glu 


Asn 
545 


Leu 


Thr 


Trp 


Pro 


Leu 


Gly Thr 


Lys 


Ala 


Gly 


Asp 










560 








Ala 


Phe 


Trp 


Ala 


Arg 
575 


Val 


Cys 


Gly 


Cys 


Val 


Asn 


He 


Phe 


Phe 


Tyr 


Ala 
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Met 


Asp 


Cys 


Ser 


Leu 


Val 


Arg 


Thr 


1 








5 








Gly 


Glu 


Glu 


Asn 


Trp 


Val 


Asp 


Ser 










20 








Arg 


Glu 


Pro 


Pro 


Pro 


Gly 


Ala 


Glu 










35 








Pro 


Asp 


Asn Arg 


He 


Val 


Ser 


Ser 










50 








He 


Pro 


Lys 


Asn 


Leu 


Phe 


Glu 


Gin 








65 








Tyr 


Phe 


Leu 


He 


He 


Phe 


Leu 


Val 










80 








Thr 


Ser 


Pro 


Val 


Thr 


Ser 


Gly 


Leu 










95 








Val 


Thr 


Ala 


He 


Lys 


Gin 


Gly 


Tyr 










110 








Ala 


Asp Asn Ala 


Met 


Asn 


Gin 


Cys 










125 








Gly 


Lys 


Leu 


Val 


Arg 


Lys 


Gin 


Ser 










140 








He 


Val 


Met 


Val 


Lys 


Glu 


Asp 


Glu 










155 








Phe 


Leu 


Ser 


Ser 


Asn 


Arg 


Gly 


Asp 










170 








Ala 


Ser 


Leu 


Asp 


Gly 


Glu 


Ser 


Ser 










185 








Gin 


Asp 


Thr 


Lys 


Gly 


Phe 


His 


Thr 










200 








His 


Ala 


Thr 


He 


Glu 


Cys 


Glu 


Gin 



215 



Arg 


Leu 


Val 


He 


Val 


Ala 


Leu 




415 










420 


Pro 


Val 


Leu 


Gin 


Asp 


Ser 


Asn 




430 










435 


Gin 


Ser 


Val 


Thr 


Ser 


Ser 


Leu 




445 










450 


Val 


Leu 


Gly 


Val 


Phe 


Trp 


Arg 




460 










465 


Trp Gly 


Leu 


He Ala Gly Leu 




475 










480 


Leu 


Glu 


Phe 


Leu 


Asn 


Pro 


Ala 




490 










495 


Arg 


Pro 


Ala 


Val 


Leu Gly 


Ser 




505 










510 


Ala 


Leu 


Phe 


Ala 


Leu 


Ser 


Gly 




520 










525 


Leu 


Thr 


Pro 


Pro 


Pro 


Gin 


Ser 




535 










540 


Trp 


Thr 


Leu 


Ala 


Gin 


Asp 


Val 




550 










555 


Gly Gin 


Thr 


Pro 


Gin 


Lys 


His 




565 










570 


Phe 


Asn 


Ala 


He 


Leu 


Leu 


Met 




580 










585 


Tyr 


Phe 


Ala 












595 












Leu 


Val 


His 


Arg 


Tyr 


Cys 


Ala 




10 










15 


Arg 


Thr 


He 


Tyr 


Val 


Gly 


His 




25 










30 


Ala 


Tyr 


He 


Pro 


Gin 


Arg 


Tyr 




40 










45 


Lys 


Tyr 


Thr 


Phe 


Trp 


Asn 


Phe 




55 










60 


Phe 


Arg 


Arg 


Val 


Ala 


Asn 


Phe 




70 










75 


Gin 


Leu 


He 


He 


Asp 


Thr 


Pro 




85 










90 


Pro 


Leu 


Phe 


Phe 


Val 


He 


Thr 




100 










105 


Glu 


Asp 


Trp 


Leu Arg 


His 


Lys 




115 










12 0 


Pro 


Val 


His 


Phe 


He 


Gin 


His 




130 










135 


Arg 


Lys 


Leu 


Arg 


Val 


Gly 


Asp 




145 










150 


Thr 


Phe 


Pro 


Cys 


Asp 


Leu 


He 




160 










165 


Gly 


Thr 


Cys 


His 


Val 


Thr 


Thr 




175 










180 


His 


Lys 


Thr 


His 


Tyr 


Ala 


Val 




190 










195 


Glu 


Glu 


Asp 


He Gly Gly 


Leu 




205 










210 


Pro 


Gin 


Pro 


Asp 


Leu 


Tyr 


Lys 




220 










225 
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Phe 


Val 


Gly 


Arg 


He 
230 


Asn 


Val 


Tyr 


Ser 


Asp 
235 


Leu 


Asn 


Asp 


Pro 


Val 
240 


Val 


Arg 


Pro 


Leu 


Gly 
245 


Ser 


Glu 


Asn 


Leu 


Leu 
250 


Leu 


Arg 


Gly 


Ala 


Thr 
255 


Leu 


Lys 


Asn 


Thr 


Glu 


Lys 


He 


Phe 


Gly Val 


Ala 


He 


Tyr 


Thr 


Gly 










260 










265 










270 


Met 


Glu 


Thr 


Lys 


Met 
275 


Ala 


Leu 


Asn 


Tyr 


Gin 
280 


Ser 


Lys 


Ser 


Gin 


Lys 
285 


Arg 


Ser 


Ala 


Val 


Glu 
290 


Lys 


Ser 


Met 


Asn 


Ala 
295 


Phe 


Leu 


He 


Val 


Tyr 
300 


Leu 


Cys 


He 


Leu 


He 


Ser 


Lys 


Ala 


Leu 


He 


Asn 


Thr 


Val 


Leu 


Lys 








305 










310 










315 


Tyr 


Val 


Trp 


Gin 


Ser 
320 


Glu 


Pro 


Phe 


Arg 


Asp 
325 


Glu 


Pro 


Trp 


Tyr 


Asn 
330 


Gin 


Lys 


Thr 


Glu 


Ser 
335 


Glu 


Arg 


Gin 


Arg 


Asn 
340 


Leu 


Phe 


Leu 


Lys 


Ala 
345 


Phe 


Thr 


Asp 


Phe 


Leu 

350 


Ala 


Phe 


Met 


Val 


Leu 
355 


Phe 


Asn 


Tyr 


He 


He 
360 


Pro 


Val 


Ser 


Met 


Tyr 
365 


Val 


Thr 


Val 


Glu 


Met 
370 


Gin 


Lys 


Phe 


Leu 


Gly 
375 


Ser 


Tyr 


Phe 


He 


Thr 
380 


Trp 


Asp 


Glu 


Asp 


Met 
385 


Phe 


Asp 


Glu 


GlU 


Thr 
390 


Gly 


Glu 


Gly 


Pro 


Leu 
395 


Val 


Asn 


Thr 


Ser 


Asp 
400 


Leu 


Asn 


Glu 


GlU 


Leu 
405 


Gly 


Gin 


Val 


Glu 


Tyr 
410 


He 


Phe 


Thr 


Asp 


Lys 
415 


Thr 


Gly 


Thr 


Leu 


Thr 
420 


Glu 


Asn 


Asn 


Met 


Glu 
425 


Phe 


Lys 


Glu 


Cys 


Cys 
430 


He 


Glu 


Gly 


His 


Val 
435 


Tyr 


Val 


Pro 


His 


Val 


He 


Cys 


Asn 


Gly Gin 


Val 


Leu 


Pro 


Glu 


Ser 










440 










445 










450 


Ser 


Gly 


He 


Asp 


Met 
455 


He 


Asp 


Ser 


Ser 


Pro 
460 


Ser 


Val 


Asn 


Gly 


Arg 
465 


GlU 


Arg 


Glu 


GlU 


Leu 
470 


Phe 


Phe 


Arg 


Ala 


Leu 
475 


Cys 


Leu 


Cys 


His 


Thr 
480 


Val 


Gin 


Val 


Lys 


Asp 
485 


Asp 


Asp 


Ser 


Val 


Asp 
490 


Gly 


Pro 


Arg 


Lys 


Ser 
495 


Pro 


Asp 


Gly 


Gly 


Lys 
500 


Ser 


Cys 


Val 


Tyr 


He 
505 


Ser 


Ser 


Ser 


Pro 


Asp 
510 


Glu 


Val 


Ala 


Leu 


Val 
515 


Glu 


Gly 


Val 


Gin 


Arg 
520 


Leu 


Gly 


Phe 


Thr 


Tyr 
525 


Leu 


Arg 


Leu 


Lys 


Asp 
530 


Asn 


Tyr 


Met 


Glu 


He 
535 


Leu 


Asn 


Arg 


Glu 


Asn 
540 


His 


He 


Glu 


Arg 


Phe 


Glu 


Leu 


Leu 


Glu 


He 


Leu 


Ser 


Phe 


Asp 


Ser 










545 










550 








555 


Val 


Arg 


Arg 


Arg 


Met 
560 


Ser 


Val 


He 


Val 


Lys 
565 


Ser 


Ala 


Thr 


Gly 


Glu 
570 


He 


Tyr 


Leu 


Phe 


Cys 
575 


Lys 


Gly 


Ala 


Asp 


Ser 
580 


Ser 


He 


Phe 


Pro 


Arg 
585 


Val 


He 


Glu 


Gly 


Lys 
590 


Val 


Asp 


Gin 


He 


Arg 
595 


Ala 


Arg 


Val 


Glu 


Arg 
600 


Asn 


Ala 


Val 


Glu 


Gly 
605 


Leu 


Arg 


Thr 


Leu 


Cys 
610 


Val 


Ala 


Tyr 


Lys 


Arg 
615 


Leu 


He 


Gin 


Glu 


Glu 
620 


Tyr 


Glu 


Gly 


He 


Cys 
625 


Lys 


Leu 


Leu 


Gin 


Ala 
63 0 


Ala 


Lys 


Val 


Ala 


Leu 
635 


Gin 


Asp 


Arg 


GlU 


Lys 
640 


Lys 


Leu 


Ala 


Glu 


Ala 
645 


Tyr 


Glu 


Gin 


He 


Glu 
650 


Lys 


Asp 


Leu 


Thr 


Leu 
655 


Leu 


Gly 


Ala 


Thr 


Ala 
660 


Val 


Glu 


Asp 


Arg 


Leu 
665 


Gin 


Glu 


Lys 


Ala 


Ala 
670 


Asp 


Thr 


He 


Glu 


Ala 
675 


Leu 


Gin 


Lys 


Ala 


Gly 
680 


He 


Lys 


Val 


Trp 


Val 
685 


Leu 


Thr 


Gly 


Asp 


Lys 
690 


Met 


Glu 


Thr 


Ala 


Ala 


Ala 


Thr 


Cys 


Tyr Ala 


Cys 


Lys 


Leu 


Phe 


Arg 










695 










700 










705 


Arg 


Asn 


Thr 


Gin 


Leu 
710 


Leu 


Glu 


Leu 


Thr 


Thr 
715 


Lys 


Arg 


He 


Glu 


Glu 
720 


Gin 


Ser 


Leu 


His 


Asp 


Val 


Leu 


Phe 


Glu 


Leu 


Ser 


Lys 


Thr 


Val 


Leu 
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Arg 


His 


Ser 


Gly 


Ser 
740 


Leu 


Thr 


Arg 


Ala 


Asp 


Met 


Gin 


Asp 
755 


Tyr 


Gly 


Leu 


Ser 


Leu 


He 


Met 


Lys 
770 


Pro 


Arg 


Glu 


Arg 


Glu 


Leu 


Phe 


Leu 
785 


Glu 


He 


Cys 


Cys 


Cys 


Arg 


Met 


Ala 
800 


Pro 


Leu 


Gin 


He 


Lys 


Phe 


Ser 


Lys 


Glu 


His 


Pro 








815 








Gly 


Ala 


Asn 


Asp 


Val 
830 


Ser 


Met 


He 


Gly 


Val 


He 


Gly 


Lys 
845 


Glu 


Gly 


Arg 


Tyr 


Ala 


He 


Pro 


Lys 
860 


Phe 


Lys 


His 


His 


Gly 


His 


Phe 


Tyr 
875 


Tyr 


He 


Arg 


Phe 


Phe 


Tyr 


Lys 


Asn 
890 


Val 


Cys 


Phe 


Gin 


Phe 


Phe 


Cys 


Gly 
905 


Phe 


Ser 


Gin 


Tyr 


Leu 


Thr 


Leu 


Tyr 
920 


Asn 


He 


Ser 


Leu 


Tyr 


Ser 


Leu 


Met 
935 


Glu 


Gin 


His 


Arg 


Asp 


Pro 


Thr 


Leu 
950 


Tyr 


Arg 


Asp 


Arg 


Trp 


Arg 


Val 


Phe 
965 


He 


Tyr 


Trp 


Ala 


Leu 


Val 


Phe 


Phe 
980 


Phe 


Gly 


Ala 


Thr 


Val 


Thr 


Ser 


Asn 
995 


Gly 


Gin 


He 


Thr 


Leu 


Val 


Phe 


Thr 
1010 


Val 


Met 


Val 


Ala 


Leu 


Asp 


Thr 


His 
1025 


Tyr 


Trp 


Thr 


Trp 


Gly 


Ser 


Leu 


Leu 
1040 


Phe 


Tyr 


Val 


Gly 


Val 


He 


Trp 


Pro 
1055 


Phe 


Leu 


Asn 


Phe 


He 


Gin 


Met 


Leu 
1070 


Ser 


Ser 


Gly 


Leu 


Leu 


Val 


Thr 


He 
1085 


Ser 


Leu 


Leu 


Leu 


Cys 


Arg 


Gin 


Leu 
1100 


Trp 


Pro 


Thr 


Asn 


Gly 


Cys 


Ala 


Gin 
1115 


Pro 


Arg 


Asp 


Leu 


Ala 


Ser 


Leu 


Gin 
1130 


Ser 


Pro 


Gly 


Ala 


Ala 


Trp 


Tyr 


Ser 
1145 


Ser 


His 


Ser 


Trp 


Lys 


Glu 


Lys 


Val 
1160 


Ser 


Thr 


Glu 


Ser 


His 


His 


His 


Cys 
1175 


Ser 


Ser 


He 


Ser 


Arg 


Val 


Gly Met 


Leu 


Val 
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730 








735 


Asp 


Asn 


Leu 


Ser 


Gly 


Leu Ser 




745 








750 


He 


He 


Asp 


Gly 


Ala 


Ala Leu 




760 








765 


Asp 


Gly 


Ser 


Ser 


Gly 


Asn Tyr 




775 








780 


Arg 


Ser 


Cys 


Ser 


Ala 


Val Leu 




790 








795 


Lys 


Ala 


Gin 


He 


Val 


Lys Leu 




805 








810 


He 


Thr 


Leu 


Ala 


He 


Gly Asp 




820 








825 


Leu 


Glu 


Ala 


His 


Val 


Gly He 




835 








840 


Gin 


Ala 


Ala Arg 


Asn 


Ser Asp 




850 








855 


Leu 


Lys 


Lys 


Met 


Leu 


Leu Val 




865' 








870 


He 


Ser 


Glu 


Leu 


Val 


Gin Tyr 




880 








885 


He 


Phe 


Pro 


Gin 


Phe 


Leu Tyr 




895 








900 


Gin 


Thr 


Leu 


Tyr 


Asp 


Thr Ala 




910 








915 


Phe 


Thr 


Ser 


Leu 


Pro 


He Leu 




925 








930 


Val 


Gly 


He Asp 


Val 


Leu Lys 




940 








945 


Val 


Ala 


Lys 


Asn 


Ala 


Leu Leu 




955 








960 


Thr 


Leu 


Leu Gly 


Leu 


Phe Asp 




970 








975 


Tyr 


Phe 


Val 


Phe 


Glu 


Asn Thr 




985 








990 


Phe 


Gly 


Asn 


Trp 


Thr 


Phe Gly 




1000 








1005 


Phe 


Thr 


Val 


Thr 


Leu 


Lys Leu 




1015 








1020 


Trp 


He 


Asn 


His 


Phe 


Val He 




1030 








1035 


Val 


Phe 


Ser 


Leu 


Leu 


Trp Gly 




1045 








1050 


Tyr 


Gin 


Arg 


Met 


Tyr 


Tyr Val 




1060 








1065 


Pro 


Ala 


Trp 


Leu 


Ala 


He Val 




1075 








1080 


Pro 


Asp 


Val 


Leu 


Lys 


Lys Val 




1090 








1095 


Ala 


Thr 


Glu 


Arg 


Val 


Gin Gin 




1105 








1110 


Arg 


Asp 


Ser 


GlU 


Phe 


Thr Pro 




1120 








1125 


Tyr 


Gin 


Ser 


Thr 


cys 


Pro Ser 




1135 








1140 


Gin 


Gin 


Val 


Thr 


Leu 


Ala Ala 




1150 








1155 


Pro 


Pro 


Pro 


He 


Leu 


Gly Gly 




1165 








1170 


Pro 


Ser 


His 


Ser 


Cys 


Pro Arg 



1180 1185 
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Met 


Gly 


Lys 


He 


Glu 


Asn 


Asn 


Glu 


Arg 


Val 


He 


Leu Asn 


Val 


Gly 


1 








5 










10 








15 


Gly 


Thr 


Arg 


His 


Glu 


Thr 


Tyr 


Arg 


Ser 


Thr 


Leu 


Lys Thr 


Leu 


Pro 










20 










25 








30 


Gly 


Thr 


Arg 


Leu 


Ala 


Leu 


Leu 


Ala 


Ser 


Ser 


Glu 


Pro Pro 


Gly Asp 










35 










40 








45 


Cys 


Leu 


Thr 


Thr 


Ala 


Gly 


Asp 


Lys 


Leu 


Gin 


Pro 


Ser Pro 


Pro 


Pro 










50 










55 








60 


Leu 


Ser 


Pro 


Pro 


Pro 


Arg 


Ala 


Pro 


Pro 


Leu 


Ser 


Pro Gly Pro 


Gly 










65 










70 








75 


Gly 


Cys 


Phe 


Glu 


Gly 


Gly 


Ala 


Gly Asn 


Cys 


Ser 


Ser Arg Gly Gly 










80 










85 








90 


Arg 


Ala 


Ser 


Asp 


His 


Pro 


Gly Gly Gly Arg 


Glu 


Phe Phe 


Phe 


Asp 










95 










100 








105 


Arg 


His 


Pro 


Gly 


Val 


Phe 


Ala 


Tyr 


Val 


Leu 


Asn 


Tyr Tyr 


Arg 


Thr 










110 










115 








120 


Gly 


Lys 


Leu 


His 


Cys 


Pro 


Ala 


Asp 


Val 


Cys 


Gly 


Pro Leu 


Phe 


Glu 










125 










130 








135 


Glu 


Glu 


Leu 


Ala 


Phe 


Trp 


Gly 


He 


Asp 


Glu 


Thr 


Asp Val 


Glu 


Pro 










140 










145 








150 


Cys 


Cys 


Trp 


Met 


Thr 


Tyr 


Arg 


Gin 


His 


Arg 


Asp 


Ala Glu 


Glu 


Ala 










155 










160 








165 


Leu 


Asp 


lie 


Phe 


Glu 


Thr 


Pro 


Asp 


Leu 


He 


Gly 


Gly Asp 


Pro 


Gly 










170 










175 








180 


Asp 


Asp 


Glu 


Asp 


Leu 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He Glu 


Asp 


Ala 










185 










190 








195 


Ala 


Gly 


Leu 


Gly 


Gly 


Pro 


Asp 


Gly 


Lys 


Ser 


Gly 


Arg Trp 


Arg 


Arg 










200 










205 








210 


Leu 


Gin 


Pro 


Arg 


Met 


Trp 


Ala 


Leu 


Phe 


Glu 


Asp 


Pro Tyr 


Ser 


Ser 










215 










220 








225 


Arg 


Ala 


Ala 


Arg 


Phe 


He 


Ala 


Phe 


Ala 


Ser 


Leu 


Phe Phe 


He 


Leu 










230 










235 








240 


Val 


Ser 


lie 


Thr 


Thr 


Phe 


Cys 


Leu 


Glu 


Thr 


His 


Glu Ala 


Phe 


Asn 










245 








250 








255 


lie 


Val 


Lys 


Asn 


ly s 


Thr 


Glu 


Pro 


Val 


He 


Asn 


Gly Thr 


Ser 


Val 










260 










265 








270 


Val 


Leu 


Gin 


Tyr 


Glu 


He 


GlU 


Thr 


Asp 


Pro 


Ala 


Leu Thr 


Tyr 


Val 










275 










280 








285 


Glu 


Gly 


Val 


Cys 


Val 


Val 


Trp 


Phe 


Thr 


Phe 


Glu 


Phe Leu 


Val 


Arg 










290 










295 








300 


lie 


Val 


Phe 


Ser 


Pro 


Asn 


Lys 


Leu 


Glu 


Phe 


He 


Lys Asn 


Leu 


Leu 










305 










310 








315 


Asn 


lie 


lie 


Asp 


Phe 


Val 


Ala 


He 


Leu 


Pro 


Phe 


Tyr Leu 


Glu 


Val 










320 










325 








330 


Gly 


Leu 


Ser 


Gly 


Leu 


Ser 


Ser 


Lys 


Ala 


Ala 


Lys 


Asp Val 


Leu Gly 










335 










340 








345 


Phe 


Leu 


Arg 


Val 


Val 


Arg 


Phe 


Val 


Arg 


He 


Leu 


Arg lie 


Phe 


Lys 










350 










355 








360 


Leu 


Thr 


Arg 


His 


Phe 


Val 


Gly Leu Arg Val 


Leu 


Gly His 


Thr 


Leu 










365 










370 








375 


Arg 


Ala 


Ser 


Thr 


Asn 


Glu 


Phe 


Leu 


Leu 


Leu 


He 


He Phe 


Leu 


Ala 










380 










385 








390 


Leu 


Gly Val 


Leu 


He 


Phe 


Ala 


Thr 


Met 


He 


Tyr 


Tyr Ala 


Glu 


Arg 










395 










400 








405 


Val 


Gly Ala 


Gin 


Pro 


Asn 


Asp 


Pro 


Ser 


Ala 


Ser 


Glu His 


Thr 


Gin 










410 










415 








420 


Phe 


Lys 


Asn 


He 


Pro 


He 


Gly 


Phe 


Trp 


Trp 


Ala 


Val Val 


Thr 


Met 










425 










430 








435 


Thr 


Thr 


Leu 


Gly 


Tyr 


Gly 


Asp 


Met 


Tyr 


Pro 


Gin 


Thr Trp 


Ser 


Gly 










440 










445 








450 
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Met 


Leu 


Val 


Gly 


Ala 


Leu 


Cys 


Ala 








455 








Ala 


Met 


Pro 


Val 
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470 


Val 


He 


Val 


Ser 


Leu 


Ala 


Met 


Ala 
485 


Lys 


Gin 


Lys 


His 


lie 


Pro 


Pro 


Ala 
500 


Pro 


Gin 


Ala 


Thr 


Glu 


Leu 


Asn 


Met 
515 


Ala 


Cys 


Asn 


Leu 


Gly 


Lys 


Asp 


Asn 
530 


Arg 


Leu 


Leu 


Ser 


Gly 


Asp 


Asp 


Ser 
545 


Thr 


Gly 


Ser 


Glu 


Arg 


Leu 


Pro 


He 
560 


Arg 


Arg 


Ser 


Arg 


Gly 


Glu 


Thr 


Cys 
575 


Phe 


Leu 


Leu 


Ala 


Ser 


Asp 


Gly 


Gly 
590 


He 


Arg 


Lys 


Leu 


Asn 


Asn 


He 


Ala 
605 


Gly 


Leu 


Ala 


Pro 


Val 


Thr 


Ser 


Pro 
620 


Tyr 


Asn 


Ser 


Arg 


Ser 


Pro 


He 


Pro 


Ser 


He 


Leu 
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Met 


Ser 


Lys 


Asp 


Leu 


Ala 


Ala 


Met 


1 








5 








Gly 


Val 


Arg 


Thr 


Glu 
20 


Thr 


Ala 


Pro 


Val 


Gly 


Leu 


His 


Ala 
35 


Tyr 


Asp 


He 


Val 


Phe 


Val 


He 


Ala 
50 


Val 


Gly 


He 


Arg 


Gly 


Thr 


He 


Gly 
65 


Gly 


Tyr 


Phe 


Trp 


Trp 


Pro 


He 


Gly 
80 


Ala 


Ser 


Leu 


Gly 


Leu 


Phe 


He 


Gly 
95 


Leu 


Ala 


Gly 


Ala 


Val 


Gly Gly 


Phe 


Glu 


Trp 


Asn 










110 








Leu 


Gly Trp 


Val 


Phe 


Val 


Pro 


Val 










125 








Thr 


Met 


Pro 


Gin 


Tyr 
140 


Leu 


Lys 


Lys 


Gin 


Val 


Tyr 


Met 


Ser 


Val 


Leu 


Ser 








155 








Lys 


He 


Ser 


Thr 


Asp 


He 


Phe 


Ser 








170 








Ala 


Leu Gly 


Trp 


Asn 


Leu 


Tyr 


Leu 










185 








Val 


Thr 


Ala. 


Val 


Tyr 
200 


Thr 


He 


Ala 


Tyr 


Thr Asp Ala 


Leu 


Gin 


Thr 


Val 



215 



Leu 


Ala 


Gly Val 


Leu 


Thr 


He 




460 








465 


Asn 


Asn 


Phe Gly Met 


Tyr 


Tyr 




475 








480 


Leu 


Pro 


Arg Lys 


Arg 


Lys 


Lys 




490 








495 


Ser 


Ser 


Pro Thr 


Phe 


Cys 


Lys 




505 








510 


Ser 


Thr 


Gin Ser 


Asp 


Thr 


Cys 




520 








525 


Glu 


His 


Asn Arg 


Ser 


Val 


Leu 




535 








540 


Glu 


Pro 


Pro Leu 


Ser 


Pro 


Pro 




550 








555 


Ser 


Thr 


Arg Asp 


Lys 


Asn 


Arg 




565 








570 


Thr 


Thr 


Gly Asp 


Tyr 


Thr 


Cys 




580 








585 


Gly 


Tyr 


Glu Lys 


Ser 


Arg 


Ser 




595 








600 


Gly 


Asn 


Ala Leu 


Arg 


Leu 


Ser 




610 








615 


Pro 


Cys 


Pro Leu 


Arg 


Arg 


Ser 




625 








630 



Gly 


Pro 


Gly Ala 


Ser Gly 


Asp 




10 






15 


His 


He 


Ala Leu 


Asp Ser 


Arg 




25 






30 


Ser 


Val 


Val Val 


He Tyr 


Phe 




40 






45 


Trp 


Ser 


Ser He 


Arg Ala 


Ser 




55 






60 


Leu 


Ala 


Gly Arg 


Ser Met 


Ser 




70 






75 


Met 


Ser 


Ser Asn 


Val Gly 


Ser 




85 






90 


Thr 


Gly Ala Ala 


Gly Gly 


Leu 




100 






105 


Ala 


Thr 


Trp Leu 


Leu Leu 


Ala 




115 




120 


Tyr 


He 


Ala Ala Gly Val 


Val 




130 






135 


Arg 


Phe 


Gly Gly Gin Arg 


He 




145 






150 


Leu 


He 


Leu Tyr 


He Phe 


Thr 




160 




165 


Gly Ala 


Leu Phe 


He Gin 


Met 




175 






180 


Ser 


Thr 


Gly He 


Leu Leu 


Val 




190 






195 


Gly Gly 


Leu Met 


Ala Val 


He 




205 






210 


He 


Met 


Val Gly Gly Ala 


Leu 




220 






225 
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Val 


Leu 


Met 


Phe 


Leu 


Gly 


Phe 


Gin 










230 






Leu 


Glu 


Gin 


Arg 


Tyr 


Arg Gin Ala 










245 








Asn 


Thr 


Thr 


Cys 


His 


Leu 


Pro 


Arg 










260 






Arg 


Asp 


Pro 


Val 


Ser 


Gly Asp 


He 










275 








Gly 


Leu 


Thr 


Val 


Leu 


Ala 


Thr 


Trp 










290 






He 


Val 


Gin 


Arg 


Ser 


Leu 


Ser 


Ala 










305 








Gly 


Gly 


Ser 


Val 


Leu 


Gly 


Gly 


Tyr 










320 








Phe 


lie 


Val 


Met 


Pro 


Gly Met 


He 










335 








Glu 


Val 


Gly 


Cys 


Val 


Asp 


Pro 


Asp 










350 








Ala 


Arg 


Val 


Gly 


Cys 


Ser 


Asn 


He 










365 








Ala 


Leu 


Met 


Pro 


Val 


Gly 


Leu 


Arg 










380 








Met 


Ala 


Ala 


Leu 


Met 


Ser 


Ser 


Leu 










395 








Ser 


Thr 


Leu 


Phe 


Thr 


He 


Asp 


Val 










410 








Ser 


Thr 


Glu 


Gin 


Glu 


Leu 


Met 


Val 










425 








Phe 


Leu 


Val 


Val 


He 


Ser 


He 


Leu 










440 








Ser 


Asn 


Ser 


Gly 


Gin 


Leu 


Phe Asp 










455 








Tyr 


Leu 


Ala 


Pro 


Pro 


He 


Thr 


Ala 










470 








Cys 


Lys 


Arg 


Val 


Thr 


Glu 


Pro 


Gly 










485 








Gly 


Leu 


Gly 


Val 


Gly 


Leu 


Leu Arg 










500 








Pro 


Ala 


Pro 


Ala 


Cys 


Gly Glu Val 










515 








Lys 


Asp 


Phe 


His 


Tyr 


Leu 


Tyr 


Phe 










530 








Thr 


Ala 


He 


Val 


He 


Val 


He 


Val 










545 








Pro 


Glu 


Glu 


Gin 


Leu 


Thr 


Arg 


Leu 










560 








Pro 


Leu 


Ser 


Glu 


Leu 


Glu 


Lys 


Glu 










575 






He 


Ser 


Glu 


Arg 


Pro 


Ala 


Gly Glu 










590 








Ala 


Glu 


Asn 


Ser 


Ser 


Leu 


Gly 


Gin 










605 






Arg 


Ser 


Trp 


Gly 


Lys 


Leu 


Leu 


Trp 










620 








Gly 


Thr 


Pro 


Glu 


Gin 


Ala 


Leu 


Ser 










635 








Glu 


Gin 


Lys 


Leu 


Thr 


Ser 


He 


Glu 










650 








Val 


Cys 


Asn 


He 


Asn 


Ala 


Val 


Leu 










665 








Leu 


Trp 


Gly 


Tyr 


Phe 


Ala 














680 
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Asp 


Val 


Gly 


Trp 


Tyr 


Pro 


Gly 




235 










240 


He 


Pro 


Asn 


Val 


Thr 


Val 


Pro 




250 










255 


Pro 


Asp 


Ala 


Phe 


His 


He 


Leu 




265 










270 


Pro 


Trp 


Pro 


Gly 


Leu 


He 


Phe 




280 










285 


Cys 


Trp 


Cys 


Thr 


Asp 


Gin 


Val 




295 










300 


Lys 


Ser 


Leu 


Ser 


His 


Ala 


Lys 




310 










315 


Leu 


Lys 


He 


Leu 


Pro 


Met 


Phe 




325 










330 


Ser 


Arg 


Ala 


Leu 


Phe 


Pro 


Asp 




340 










345 


Val 


Cys 


Gin 


Arg 


He 


Cys 


Gly 




355 










360 


Ala 


Tyr 


Pro 


Lys 


Leu 


Val 


Met 




370 










375 


Gly 


Leu 


Met 


lie 


Ala 


Val 


He 




385 










390 


Thr 


Ser 


He 


Phe 


Asn 


Ser 


Ser 




400 










405 


Trp 


Gin 


Arg 


Phe 


Arg 


Arg 


Lys 




415 










420 


Val 


Gly 


Arg 


Val 


Phe 


Val 


Val 




43 0 










435 


Trp 


He 


Pro 


He 


He 


Gin 


Ser 




445 










450 


Tyr 


He 


Gin 


Ala 


Val 


Thr 


Ser 




460 










465 


Leu 


Phe 


Leu 


Leu 


Ala 


He 


Phe 




475 










480 


Ala 


Phe 


Trp 


Gly 


Leu 


Val 


Phe 




490 










495 


Met 


He 


Leu 


Glu 


Phe 


Ser 


Tyr 




505 










510 


Asp 


Arg 


Arg 


Pro 


Ala 


Val 


Leu 




520 










525 


Ala 


He 


Leu 


Leu 


Cys 


Gly 


Leu 




535 










540 


Ser 


Leu 


Cys 


Thr 


Thr 


Pro 


He 




550 










555 


Thr 


Trp 


Trp 


Thr 


Arg 


Asn 


cys 




565 










570 


Ala 


His 


Glu 


Ser 


Thr 


Pro 


Glu 




580 










585 


Cys 


Pro 


Ala 


Gly 


Gly 


Gly 


Ala 




595 










600 


Glu 


Gin 


Pro 


Glu 


Ala 


Pro 


Ser 




610 










615 


Ser 


Trp 


Phe 


Cys 


Gly 


Leu 


Ser 




625 










630 


Pro 


Ala 


Glu 


Lys 


Ala 


Ala 


Leu 




640 










645 


Glu 


Glu 


Pro 


Leu 


Trp 


Arg 


His 




655 










660 


Leu 


Leu 


Ala 


He 


Asn 


He 


Phe 



670 675 
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Met 


Trp 


Arg 


Trp 


He 


Arg 


Gin 


Gin 
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5 








Gin 


Ser 


Asp 


Thr 


Arg 


Thr 


He 


Tyr 










20 








Asn 


Gly Leu 


Tyr 


Thr 


Pro 


Gin 


Lys 










35 








Ser 


Ser 


Lys 


Tyr 


Thr 


Val 


Trp 


Asn 










50 








Glu 


Gin 


Phe 


Arg 


Arg 


Val 


Ala 


Asn 










65 








Leu 


Val 


Gin 


Leu 


Met 


He 


Asp 


Thr 










80 








Gly 


Leu 


Pro 


Leu 


Phe 


Phe 


Val 


He 










95 








Gly 


Tyr 


Glu Asp 


Trp 


Leu 


Arg 


His 










110 








Gly 


Ala 


Pro 


Val 


Tyr 


Val 


Val 


Arg 










125 








Arg 


Ser 


Lys 


Asn 


He 


Arg 


Val 


Gly 










140 








Asp 


Glu 


He 


Phe 


Pro 


Ala 


Asp 


Leu 










155 








Leu 


Asp 


Gly 


Ser 


Cys 


His 


Val 


Thr 










170 








Thr 


Asn 


Leu 


Lys 


Thr 


His 


Val 


Ala 










185 








Gin 


Thr 


Val 


Ala 


Asn 


Leu 


Asp 


Thr 










200 








Gin 


Gin 


Pro 


Glu 


Ala 


Asp 


Leu 


Tyr 










215 








He 


Thr 


Gin 


Gin 


Met 


Glu 


Glu 


He 










230 








Ser 


Leu 


Leu 


Leu 


Arg 
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Ala 


Arg 










245 








Phe 


Gly 


Val 


Ala 


Val 


Tyr 


Thr 


Gly 










260 








Asn 


Tyr 


Lys 


Ser 


Lys 


Ser 


Gin 


Lys 










275 








Met 


Asn 


Thr 


Phe 


Leu 


He 


He 


Tyr 










290 








Ala 


Val 


He 


Ser 


Thr 


He 


Leu 


Lys 










305 








Lys 


Trp 


Asp 


Glu 


Pro 


Trp 


Tyr Asn 










320 








Asn 


Ser 


Ser 


Lys 


Val 


Glu 


Tyr 


Val 










335 








Leu 


Thr 


Glu 


Asn 


Glu 


Met 


Gin 


Phe 










350 








Met 


Lys Tyr Gin 


Glu 
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Asn Gly 
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Pro 
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Ser 
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Glu 


Gly Asn 










380 








Ser 


His 


Leu 


Asn 


Asn 


Leu 


Ser 


His 










395 








Arg 


Thr 


Ser 


Pro 


Glu 


Asn 


Glu 


Thr 








410 








Leu 


Phe 


Phe 


Lys 


Ala 


Val 


Ser 


Leu 










425 








Asn 


Val 


Gin 


Thr 


Asp 


Cys 


Thr 


Gly 










440 








Leu 


Ala 


Pro 


Ser 


Gin 


Leu 


Glu 


Tyr 



455 



Leu 


Gly 


Phe 


Asp 


Pro 


Pro 


His 




10 










15 


Val 


Ala 


Asn 


Arg 


Phe 


Pro 


Gin 




25 










30 


Phe 


He 


Asp 


Asn Arg 


He 


He 




40 










45 


Phe 


Val 


Pro 


Lys 


Asn 


Leu 


Phe 




55 








60 


Phe 


Tyr 


Phe 


Leu 


He 


He 


Phe 




70 










75 


Pro 


Thr 


Ser 


Pro 


Val 


Thr 


Ser 




85 










90 


Thr 


Val 


Thr 


Ala 


He 


Lys 


Gin 




100 










105 


Asn 


Ser 


Asp 


Asn 


Glu 


Val 


Asn 




115 










120 


Ser 


Gly 


Gly 


Leu 


Val 


Lys 


Thr 
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135 


Asp 


He 


Val 


Arg 


He 


Ala 


Lys 




145 










150 


Val 


Leu 


Leu 


Ser 


Ser 


Asp 


Arg 




160 










165 


Thr 


Ala 


Ser 


Leu 


Asp 


Gly 


Glu 




175 










180 


Val 


Pro 


Glu 


Thr 


Ala 


Leu 


Leu 




190 










195 


Leu 


Val 


Ala 


Val 


He 


Glu 


Cys 




205 










210 


Arg 


Phe 


Met 


Gly Arg 


Met 


He 




220 










225 


Val 


Arg 


Pro 


Leu 


Gly 


Pro 


Glu 




235 










240 


Leu 


Lys 


Asn 


Thr 


Lys 


Glu 


He 




250 










255 


Met 


Glu 


Thr 


Lys 


Met 


Ala 


Leu 




265 










270 


Arg 


Ser 


Ala 


Val 


Glu 


Lys 


Ser 




280 










285 


Leu 


Val 


He 


Leu 


He 


Ser 


Glu 




295 










300 


Tyr 


Thr 


Trp 


Gin 


Ala 


Glu 


Glu 




310 










315 


Gin 


Lys 


Thr 


Glu 
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Gin 


Arg 
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330 
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Thr 
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Thr 
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340 
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Arg 
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He 


Asn 
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Pro 
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Ser 


Tyr 


Leu 


Ser 


Ser 


Leu 
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Leu 


Thr 


Thr 


Ser 
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Ser 
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400 










405 


Glu 
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He 


Lys 
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Asp 




415 










420 
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Gin 


He 
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430 
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Asp 
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Pro 
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Ser 
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460 










465 
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Lys 


Ala 


Leu 


Val 


Glu 
470 


Ala 


Ala 


Ala 


Arg 


He 
475 


Gly 


He 


Val 


Phe 


He 
480 


Gly 


Asn 


Ser 


Glu 


Glu 
485 


Thr 


Met 


Glu 


Val 


Lys 
490 


Thr 


Leu 


Gly 


Lys 


Leu 
495 


Glu 


Arg 


Tyr 


Lys 


Leu 
500 


Leu 


His 


He 


Leu 


Glu 
505 


Phe 


Asp 


Ser 


Asp 


Arg 
510 


Arg 


Arg 


Met 


Ser 


Val 
515 


He 


Val 


Gin 


Ala 


Pro 
520 


Ser 


Gly 


Glu 


Lys 


Leu 
525 


Leu 


Phe 


Ala 


Lys 


Gly 
530 


Ala 


Glu 


Ser 


Ser 


He 
535 


Leu 


Pro 


Lys 


Cys 


He 
540 


Gly 


Gly 


Glu 


He 


Glu 
545 


Lys 


Thr 


Arg 


He 


His 
550 


Val 


Asp 


Glu 


Phe 


Ala 
555 


Leu 


Lys 


Gly 


Leu 


Arg 
560 


Thr 


Leu 


Cys 


He 


Ala 
565 


Tyr 


Arg 


Lys 


Phe 


Thr 
570 


Ser 


Lys 


Glu 


Tyr 


Glu 
575 


Glu 


He 


Asp 


Lys 


Arg 
580 


He 


Phe 


Glu 


Ala 


Arg 
585 


Thr 


Ala 


Leu 


Gin 


Gin 
590 


Arg 


Glu 


Glu 


Lys 


Leu 
595 


Ala 


Ala 


Val 


Phe 


Gin 
600 


Phe 


He 


Glu 


Lys 


Asp 
605 


Leu 


He 


Leu 


Leu 


Gly 
610 


Ala 


Thr 


Ala 


Val 


Glu 
615 


Asp 


Arg 


Leu 


Gin 


Asp 
620 


Lys 


Val 


Arg 


Glu 


Thr 

625 


He 


Glu 


Ala 


Leu 


Arg 
630 


Met 


Ala 


Gly 


He 


Lys 
63 5 


Val 


Trp 


Val 


Leu 


Thr 
640 


Gly 


Asp 


Lys 


His 


Glu 
645 


Thr 


Ala 


Val 


Ser 


Val 
650 


Ser 


Leu 


Ser 


Cys 


Gly 
655 


His 


Phe 


His 


Arg 


Thr 
660 


Met 


Asn 


He 


Leu 


Glu 
665 


Leu 


He 


Asn 


Gin 


Lys 
670 


Ser 


Asp 


Ser 


Glu 


Cys 
675 


Ala 


Glu 


Gin 


Leu 


Arg 
680 


Gin 


Leu 


Ala 


Arg 


Arg 
685 


He 


Thr 


Glu 


Asp 
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He 


Gin 
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Leu 


Val 


Val 
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Thr 


Ser 


Leu 


Ser 


Leu 
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700 
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Ala 


Leu 


Arg 


Glu 
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Glu 


Lys 


Leu 


Phe 


Met 
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Val 


Cys 


Arg 


Asn 










710 
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720 


Cys 
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Ala 


Val 


Leu 
725 


Cys 


Cys 


Arg 


Met 


Ala 
730 


Pro 


Leu 


Gin 


Lys 


Ala 
735 


Lys 


Val 


He 


Arg 


Leu 
740 


He 


Lys 


He 
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Pro 
745 


Glu 


Lys 


Pro 


He 


Thr 
750 


Leu 


Ala 


Val 
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Asp 
755 
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Ala 


Asn 
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Val 
760 
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Met 


He 


Gin 


Glu 
765 
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He 
770 
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He 


Met 
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Lys 
775 
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Arg 


Gin 


Ala 
780 


Ala 


Arg 


Asn 


Ser 


Asp 
785 


Tyr 


Ala 


He 


Ala 


Arg 
790 


Phe 


Lys 


Phe 


Leu 


Ser 
795 


Lys 


Leu 


Leu 


Phe 


Val 
800 


His 


Gly 


His 


Phe 


Tyr 
805 


Tyr 


He 


Arg 


He 


Ala 
810 


Thr 


Leu 


Val 


Gin 


Tyr 
815 


Phe 


Phe 


Tyr 


Lys 


Asn 
820 


Val 


Cys 


Phe 


He 


Thr 
825 


Pro 


Gin 


Phe 


Leu 


Tyr 
83 0 


Gin 


Phe 


Tyr 


Cys 


Leu 
835 


Phe 


Ser 


Gin 


Gin 


Thr 
840 


Leu 


Tyr 


Asp 


Ser 


Val 
845 


Tyr 


Leu 


Thr 


Leu 


Tyr 
850 


Asn 


He 


Cys 


Phe 


Thr 
855 


Ser 


Leu 


Pro 


He 


Leu 


He 


Tyr 


Ser 


Leu 


Leu 


Glu 


Gin 


His 


Val 


Asp 










860 








865 










870 


Pro 


His 


Val 


Leu 


Gin 
875 


Asn 


Lys 


Pro 


Thr 


Leu 
880 


Tyr 


Arg 


Asp 


He 


Ser 
885 


Lys 


Asn 


Arg 


Leu 


Leu 
890 


Ser 


He 


Lys 


Thr 


Phe 
895 


Leu 


Tyr 


Trp 


Thr 


He 
900 


Leu 


Gly 


Phe 


Ser 


His 
905 


Ala 


Phe 


He 


Phe 


Phe 
910 


Phe 


Gly 


Ser 


Tyr 


Leu 
915 


Leu 


He 


Gly 


Lys 


Asp 
920 


Thr 


Ser 


Leu 


Leu 


Gly 
925 


Asn 


Gly 


Gin 


Met 


Phe 
930 


Gly 


Asn 


Trp 


Thr 


Phe 
935 


Gly 


Thr 


Leu 


Val 


Phe 
940 


Thr 


Val 


Met 


Val 


lie 
945 


Thr 


Val 


Thr 


Val 


Lys 
950 


Met 


Ala 


Leu 


Glu 


Thr 
955 


His 


Phe 


Trp 


Thr 


Trp 
960 


He 


Asn 


His 


Leu 


Val 


Thr 


Trp 


Gly 


Ser 


He 


He 


Phe 


Tyr 


Phe 


Val 
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965 






Phe 


Ser 


Leu 


Phe Tyr 


Gly Gly 


He 








980 






Gin 


As n 


Met 


Tyr Phe 


Val Phe 


He 








995 






Ala 


Trp 


Phe 


Ala He 


He Leu 


Met 








1010 






Asp 


lie 


He 


Lys Lys 


Val Phe 


Asp 








1025 






Thr 


Glu 


Lys 


Ala Gin 


Leu Thr 


Glu 








1040 






Leu 


Asp 


Ser 


Met Cys 


Cys Phe 


Pro 








1055 






Ser 


Val 


Gly 


Arg Met 


Leu Glu Arg 








1070 






Thr 


His 


He 


Ser Arg 


Cys Glu 


He 








1085 






Arg 
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<400> 21 



Met 


Ala 


Leu 


Gin 


Met 


Phe 


Val 


Thr 


-L 








z> 








Leu 


Leu 


Leu 


Val 


Ala 


Leu 


Glu 


Cys 










20 






Asn 


Glu 


Ser 


Ala 


Asn 


Ser 


Thr 


Ala 










35 








Phe 


Ala 


Ala 


Ala 


Ser 


Ser 


Glu 


Pro 










50 








Glu 


Leu 


Asp 


Tyr 


Asp 


Tyr 


Val 


Gin 










65 








Trp 


He 


Leu 


Leu 


Ala 


Ser 


Leu 


Ala 










80 








His 


Arg 


Leu 


Pro 


Gly 


Leu 


Met 


Pro 










95 








Val 


Gly 


Ala 


Leu 


Val 


Gly 


Gly 


He 










110 








Ser 


Pro 


Pro 


Val 


Met 


Asp 


Ser 


Ser 










125 






Pro 


Pro 


He 


Val 


Leu 


Glu 


Gly 


Gly 










140 








Phe 


Phe 


Glu 


Asn 


He 


Gly 


Ser 


He 










155 






Ala 


Leu 


He 


Asn 


Ala 


Leu 


Gly 


He 










170 






Cys 


Gin 


Val 


Lys 


Ala 


Phe 


Gly 


Leu 










185 








Asn 


Leu 


Leu 


Phe 


Gly 


Ser 


Leu 


He 










200 








Val 


Leu 


Ala 


Val 


Phe 


Glu 


Glu 


Ala 










215 








Met 


Met 


He 


Phe 


Gly 


Glu 


Ala 


Leu 










230 








Val 


Leu 


Tyr 


Asn 


Met 


Leu 


He 


Ala 










245 








Glu 


Asp 


He 


Glu 


Thr 


Val 


Asp 


He 










260 








He 


Val 


Val 


Gly 


Leu 


Gly 


Gly 


Val 



970 975 



Leu 


Trp 


Pro 


Phe 


Leu 


Gly 


Ser 




985 










990 


Gin 


Leu 


Leu 


Ser 


Ser 


Gly 


Ser 


1000 








1005 


Val 


Val 


Thr 


Cys 


Leu 


Phe 


Leu 


1015 








1020 


Arg 


His 


Leu 


His 


Pro 


Thr 


Ser 


1030 








1035 


Thr 


Asn 


Ala 


Gly 


He 


Lys 


Cys 


1045 








1050 


Glu 


Gly 


Glu 


Ala 


Ala 


Cys 


Ala 


1060 








1065 


Val 


He 


Gly Arg 


Cys 


Ser 


Pro 


1075 








1080 


Ser 


Leu 


Ser 


Ser 


Leu 


Cys 


Cys 


1090 








1095 


Tvr 


Ser 


Pro 


Trp 


Asn 


Cvs 


Leu 




10 










15 


Ser 


Glu 


Ala 


Ser 


Ser 


Asp 


Leu 




25 










30 


Gin 


Tvr 


Ala 


Ser 


Asn 


Ala 


Trp 




40 










45 


Glu 


Glu 


Gly 


He 


Ser 


Val 


Phe 




55 








60 


He 


Pro 


Tyr 


Glu 


Val 


Thr 


Leu 




70 










75 


Lys 


He 


Gly 


Phe 


His 


Leu 


Tyr 




85 










90 


Glu 


Ser 


Cys 


Leu 


Leu 


He 


Leu 




100 










105 


He 


Phe 


Gly Thr 


Asp 


His 


Lys 




115 










120 


He 


Tyr 


Phe 


Leu 


Tyr 


Leu 


Leu 




130 










135 


Tyr 


Phe 


Met 


Pro 


Thr 


Arg 


Pro 




145 










150 


Leu 


Trp 


Trp Ala 


Val 


Leu 


Gly 




160 










165 


Gly 


Leu 


Ser 


Leu 


Tyr 


Leu 


He 




175 










180 


Gly 


Asp 


Val 


Asn 


Leu 


Leu 


Gin 




190 










195 


Ser 


Ala 


Val 


Asp 


Pro 


Val 


Ala 




205 










210 


Arg 


Val 


Asn 


Glu 


Gin 


Leu 


Tyr 




220 










225 


Leu 


Asn 


Asp 


Gly 


He 


Thr 


Val 




235 










240 


Phe 


Thr 


Lys 


Met 


His 


Lys 


Phe 




250 










255 


Leu 


Ala 


Gly Cys 


Ala 


Arg 


Phe 




265 










270 


Leu 


Phe 


Gly 


He 


Val 


Phe 


Gly 
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275 








Phe 


He 


Ser 


Ala 


Phe 


He 


Thr 


Arg 










290 








He 


Glu 


Pro 


Leu 


He 


Val 


Phe 


Met 










305 








Ala 


Ala 


Glu 


Thr 


Leu 


Tyr 


Leu 


Ser 










320 






Cys 


Ala 


Val 


Thr 


Met 


Lys 


Lys 


Tyr 










335 








Thr 


Ser 


Tyr 


Thr 


Thr 


He 


Lys 


Tyr 










350 








Val 


Ser 


Glu 


Thr 


Leu 


He 


Phe 


He 










365 








Gly 


Lys 


Asn 


His 


Glu 


Trp Asn Trp 










380 








Ala 


Phe 


Cys 


Gin 


He 


Trp Arg Ala 










395 








Tyr 


He 


Ser 


Asn 


Gin 


Phe Arg Thr 










410 








Gin 


Cys 


He 


He 


Phe 


Tyr 


Ser Gly 










425 








Ser 


Leu 


Ala 


Phe 


Leu 


Leu 


Pro 


Leu 










440 








Met 


Phe 


Val 


Thr 


Ala 


Thr 


Leu 


Val 










455 








He 


Gin 


Gly 


He 


Thr 


Val 


Gly 


Pro 










470 








Lys 


Lys 


Thr 


Asn 


Lys 


Lys 


Glu 


Ser 










485 








Arg 


Leu 


Met 


Asp 


His 


Leu 


Lys 


Ala 










500 








His 


Trp 


Ser 


His 


Tyr 


Gin 


Val 


Arg 










515 








His 


Arg 


Tyr 


Leu 


Arg 


Lys 


He 


Leu 










530 








Ser 


Ser 


He 


Val 


Ser 


Leu 


Tyr 


Lys 










545 








He 


Glu 


Met 


Val 


Glu 


Thr 


Gly 


He 










560 








He 


Pro 


His 


Gin 


Ala 


Gin 


Arg 


He 










575 








Pro 


Glu 


Asp 


Val 


Glu 


Ser 


He 


Arg 










590 








Tyr 


Gin 


Val 


Arg 


Gin 


Arg 


Thr 


Leu 










605 








Lys 


Pro 


Gin 


Thr 


Ser 


GlU 


Lys 


Gin 










620 








Arg 


Gin 


Asn 


Thr 


Leu 


Arg 


Glu 


Ser 










635 








Pro 


Trp 


Gly 


Lys 


Pro 


Ala 


Gly 


Thr 










650 








Tyr 


Pro 


Tyr 


Gly 


Asn 


Pro 


Gin 


Ser 










665 








Ala 


Gly 


Phe 


Ser 


Gly 


Lys 


Leu 


Pro 










680 








Ser 


Val 


Glu 


Ser 


Gly Gly Lys 


Tyr 










695 








Gin 


His 
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280 










285 


Phe 


Thr 

295 


Gin 


Asn 


He 


Ser 


Ala 
300 


Phe 


Ser 
310 


Tyr 


Leu 


Ser 


Tyr 


Leu 
315 


Gly 


He 
325 


Leu 


Ala 


He 


Thr 


Ala 
330 


Val 


Glu 
340 


Glu 


Asn 


Val 


Ser 


Gin 
345 


Phe 


Met 


Lys 


Met 


Leu 


Ser 


Ser 




355 








360 


Phe 


Met 


Gly Val 


Ser 


Thr 


Val 




370 










375 


Ala 


Phe 
385 


He 


Cys 


Phe 


Thr 


Leu 
390 


He 


Ser 
400 


Val 


Phe 


Ala 


Leu 


Phe 
405 


Phe 


Pro 
415 


Phe 


Ser 


He 


Lys 


Asp 
420 


Val 


Arg 


Gly Ala 


Gly 


Ser 


Phe 




43 0 










435 


Ser 


Leu 
445 


Phe 


Pro 


Arg 


Lys 


Lys 
450 


Val 


He 


Tyr 


Phe 


Thr 


Val 


Phe 




460 








465 


Leu 


Val 


Arg 


Tyr 


Leu Asp 


Val 




475 










480 


He 


Asn 
490 


Glu 


Glu 


Leu 


His 


He 
495 


Gly 


He 
505 


Glu 


Asp 


Val 


Cys 


Gly 
510 


Asp 


Lys 
520 


Phe 


Lys 


Lys 


Phe 


Asp 
525 


He 


Arg 
535 


Lys 


Asn 


Leu 


Pro 


Lys 
540 


Lys 


Leu 
550 


Glu 


Met 


Lys 


Gin 


Ala 
555 


Leu 


Ser 
565 


Ser 


Thr 


Ala 


Phe 


Ser 
570 


Gin 


Gly 
580 


He 


Lys 


Arg 


Leu 


Ser 
585 


Asp 


He 
595 


Leu 


Thr 


Ser 


Asn 


Met 
600 


Ser 


Tyr 
610 


Asn 


Lys 


Tyr 


Asn 


Leu 
615 


Ala 


Lys 
625 


Glu 


He 


Leu 


He 


Arg 
63 0 


Met 


Arg 
640 


Lys 


Gly 


His 


Ser 


Leu 
645 


Lys 


Asn 
655 


He 


Arg 


Tyr 


Leu 


Ser 
660 


Ala 


Gly 
670 


Arg 


Asp 


Thr 


Arg 


Ala 
675 


Thr 


Trp 
685 


Leu 


Leu 


Cys 


Cys 


Phe 
690 


Leu 


Gly 
700 


Val 


Trp 


Ala 


Lys 


Arg 
705 
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<223> Incyte ID 


No: 


<400> 22 




Met 


Gly 


Gly Phe 


Leu 


1 






5 


Gin 


Lys 


Leu Leu 


Pro 








20 


Gin 


His 


Leu Asp 


Lys 








35 


Glu 


Ser 


Pro Leu 


Leu 








50 


Leu 


Arg 


Gin Leu 


Leu 








65 


Gly 


Ala 


Leu Gly 


Glu 








80 


Asn 


Leu 


Glu Ala 


Ala 








95 


Val 


Phe 


Glu Pro 


Thr 








110 


Leu 


His 


He Ala 


Val 








125 


Leu 


Leu 


Thr Arg 


Arg 








140 


Ala 


Phe 


Arg Arg 


Ser 








155 


Pro 


Leu 


Ser Phe 


Ala 








170 


Leu 


Leu 


He Glu 


His 








185 


Gly 


Asn 


Thr Val 


Leu 






200 


Phe 


Ala 


Cys Gin 


Met 






215 


Gly 


Asp 


His Leu 


Gin 








230 


Leu 


Thr 


Pro Phe 


Lys 








245 


Phe 


Gin 


His Leu 


Met 








260 


Gly 


Pro 


Leu Thr 


Ser 








275 


Trp 


Gly Glu Glu 


Leu 








290 


Lys 


Arg 


Glu Ala 


Arg 








305 


Leu 


Val 


Ser Phe 


Lys 








320 


He 


Leu 


Ala Ala 


Leu 








335 


Cys 


Cys 


Val Tyr 


Arg 








350 


His 


Ser 


Arg Asp 


He 








365 


Ala 


Tyr 


Glu Thr 


Arg 








380 


Val 


Ser 


He Val 


Gly 








395 


Asp 


He 


Phe Arg 


Val 








410 


Leu 


Gly 


Gly Pro 


Phe 








425 


Val 


Leu 


Val Thr 


Met 








440 


Val 


Val 


Pro Met 


Ser 








455 


Met 


Tyr 


Phe Thr 


Arg 








470 



7477852CD1 



Pro 


Lys 


Ala 


Glu 


Gly 










10 


Ser 


Phe 


Leu 


Val 


Arg 










25 


Leu 


His 


Met 


Leu 


Gin 










40 


Arg Ala 


Ser 


Lys 


Glu 










55 


Leu 


Asp 


Cys 


Thr 


Cys 










70 


Thr 


Ala 


Leu 


His 


He 










85 


Leu 


Val 


Leu 


Met 


Glu 










100 


Thr 


Cys 


Glu 


Ala 


Phe 










115 


Val 


Asn 


Gin 


Asn 


Val 










130 


Ala 


Ser 


Val 


Ser 


Ala 










145 


Pro 


Arg 


Asn 


Leu 


He 










160 


Ala 


Cys 


Val 


Asn 


Ser 










175 


Gly Ala 


Asp 


He 


Arg 










190 


His 


He 


Leu 


He 


Leu 










205 


Tyr 


Asn 


Leu 


Leu 


Leu 








220 


Pro 


Leu 


Asp 


Leu 


Val 










235 


Leu 


Ala 


Gly Val 


Glu 










250 


Gin 


Lys 


Arg 


Arg 


His 










265 


He 


Leu 


Tyr Asp 


Leu 










280 


Ser 


Phe 


Leu 


Glu 


Leu 










295 


Gin 


He 


Leu 


Glu 


Gin 










310 


Trp 


Asn 


Lys Tyr 


Gly 










325 


Tyr 


Leu 


Leu 


Tyr 


Met 










340 


Pro 


Leu 


Lys 


Phe 


Arg 










355 


Thr 


He 


Leu 


Gin 


Gin 










370 


Glu 


Asp 


He 


He 


Arg 










385 


Ala 


Val 


He 


He 


Leu 










400 


Gly Ala 


Ser 


Arg 


Tyr 










415 


His 


Val 


He 


Met 


He 










430 


Val 


Met 


Arg 


Leu 


Thr 










445 


Phe 


Ala 


Leu 


Val 


Leu 










460 


Gly 


Phe 


Gin 


Met 


Leu 










475 



Pro 


Gly 


Ser 


Gin 


Leu 








15 


Glu 


Gin 


Asp 


Trp 


Asp 










30 


Gin 


Lys 


Arg 


He 


Leu 










45 


Asn 


Asp 


Leu 


Ser 


Val 










60 


Asp 


Val 


Arg 


Gin 


Arg 










75 


Ala 


Ala 


Leu 


Tyr 


Asp 










90 


Ala 


Ala 


Pro 


Glu 


Leu 










105 


Ala Gly Gin 


Thr 


Ala 










120 


Asn 


Leu 


Val 


Arg 


Ala 










135 


Arg 


Ala 


Thr 


Gly 


Thr 










150 


Tyr 


Phe 


Gly Glu 


His 










165 


Glu 


Glu 


He 


Val 


Arg 










180 


Ala 


Gin 


Asp 


Ser 


Leu 










195 


Gin 


Pro 


Asn 


Lys 


Thr 










210 


Ser 


Tyr 


Asp 


Gly 


His 










225 


Pro 


Asn 


His 


Gin 


Gly 










240 


Gly Asn 


Thr 


Val 


Met 










255 


He 


Gin 


Trp 


Thr 


Tyr 










270 


Thr 


Glu 


He 


Asp 


Ser 










285 


Val 


Val 


Ser 


Ser 


Asp 










300 


Thr 


Pro 


Val 


Lys 


Glu 










315 


Arg 


Pro 


Tyr 


Phe 


Cys 










330 


He 


Cys 


Phe 


Thr 


Thr 








345 


Gly Gly Asn Arg 


Thr 










360 


Lys 


Leu 


Leu 


Gin 


Glu 










375 


Leu 


Val 


Gly 


Glu 


Leu 










390 


Leu 


Leu 


Glu 


He 


Pro 










405 


Phe Gly Lys 


Thr 


He 










420 


Thr 


Tyr 


Ala 


Ser 


Leu 








435 


Asn Thr Asn Gly 


Glu 










450 


Gly Trp 


Cys 


Ser 


Val 










465 


Gly 


Pro 


Phe 


Thr 


He 










480 
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Met 


He 


Gin 


Lys 


Met 
485 


He 


Phe 


Gly 


Asp 


Leu 
490 


Met 


Arg 


Phe 


Cys 


Trp 
495 


Leu 


Met 


Ala 


Val 


Val 
500 


He 


Leu 


Gly 


Phe 


Ala 
505 


Ser 


Ala 


Phe 


Tyr 


He 
510 


He 


Phe 


Gin 


Thr 


Glu 


Asp 


Pro 


Thr 


Ser 


Leu 


Gly Gin 


Phe 


Tyr 


Asp 










515 










520 










525 


Tyr 


Pro 


Met 


Ala 


Leu 


Phe 


Thr 


Thr 


Phe 


Glu 


Leu 


Phe 


Leu 


Thr 


Val 








530 










535 










540 


He 


Asp 


Ala 


Pro 


Ala 
545 


Asn 


Tyr 


Asp 


Val 


Asp 
550 


Leu 


Pro 


Phe 


Met 


Phe 
555 


Ser 


He 


Val 


Asn 


Phe 
560 


Ala 


Phe 


Ala 


He 


He 
565 


Ala 


Thr 


Leu 


Leu 


Met 
570 


Leu 


Asn 


Leu 


Phe 


He 
575 


Ala 


Met 


Met 


Gly 


Asp 
580 


Thr 


His 


Trp 


Arg 


Val 
585 


Ala 


Gin 


GlU 


Arg 


Asp 
590 


Glu 


Leu 


Trp 


Arg 


Ala 
595 


Gin 


Val 


Val 


Ala 


Thr 
600 


Thr 


Val 


Met 


Leu 


Glu 
605 


Arg 


Lys 


Leu 


Pro 


Arg 
610 


Cys 


Leu 


Trp 


Pro 


Arg 
615 


Ser 


Gly 


He 


Cys 


Gly 
620 


Cys 


Glu 


Phe 


Gly 


Leu 
625 


Gly 


Asp 


Arg 


Trp 


Phe 
63 0 


Leu 


Arg 


Val 


Glu 


Asn 
635 


His 


Asn 


Asp 


Gin 


Asn 
640 


Pro 


Leu 


Arg 


Val 


Leu 
645 


Arg 


Tyr 


Val 


Glu 


Val 
650 


Phe 


Lys 


Asn 


Ser 


Asp 
655 


Lys 


Glu 


Asp 


Asp 


Gin 
660 


Glu 


His 


Pro 


Ser 


Glu 
665 


Lys 


Gin 


Pro 


Ser 


Gly 
670 


Ala 


Glu 


Ser 


Gly 


Thr 
675 


Leu 


Ala 


Arg 


Ala 


Ser 
680 


Leu 


Ala 


Leu 


Pro 


Thr 
685 


Ser 


Ser 


Leu 


Ser 


Arg 
690 


Thr 


Ala 


Ser 


Gin 


Ser 
695 


Ser 


Ser 


His 


Arg 


Gly 
700 


Trp 


Glu 


He 


Leu 


Arg 
705 


Gin 


Asn 


Thr 


Leu 


Gly 
710 


His 


Leu 


Asn 


Leu 


Gly 
715 


Leu 


Asn 


Leu 


Ser 


GlU 
720 


Gly 


Asp 


Gly Glu 


Glu 


Val 


Tyr 


His 


Phe 















725 
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<400> 23 



Met 


Ala 


Thr 


Lys 


Pro 


Thr 


Glu Pro 


Val 


Thr 


He 


Leu 


Ser Leu 


Arg 


1 






5 








10 








15 


Lys 


Leu 


Ser 


Leu 


Gly 


Thr 


Ala Glu 


Pro 


Gin 


Val 


Lys 


Glu Pro 


Lys 










20 








25 








30 


Thr 


Phe 


Thr 


Val 


Glu 


Asp 


Ala Val 


Glu 


Thr 


He 


Gly 


Phe Gly 


Arg 










35 








40 








45 


Phe 


His 


He 


Ala 


Leu 


Phe 


Leu He 


Met 


Gly 


Ser 


Thr Gly Val 


Val 










50 








55 








60 


Glu 


Ala 


Met 


Glu 


He 


Met 


Leu He 


Ala 


Val 


Val 


Ser 


Pro Val 


He 










65 








70 








75 


Arg 


Cys 


Glu 


Trp 


Gin 


Leu 


Glu Asn 


Trp 


Gin 


Val 


Ala 


Leu Val 


Thr 










80 








85 








90 


Thr 


Met 


Val 


Phe 


Phe 


Gly Tyr Met 


Val 


Phe 


Ser 


He 


Leu Phe 


Gly 










95 








100 








105 


Leu 


Leu 


Ala 


Asp 


Arg 


Tyr 


Gly Arg 


Trp 


Lys 


He 


Leu 


Leu He 


Ser 










110 








115 








120 


Phe 


Leu 


Trp 


Gly 


Ala 


Tyr 


Phe Ser 


Leu 


Leu 


Thr 


Ser 


Phe Ala 


Pro 










125 








130 








135 


Ser 


Tyr 


He 


Trp 


Phe 


Val 


Phe Leu 


Arg 


Thr 


Met 


Val 


Gly Cys 


Gly 










140 








145 








150 


Val 


Ser 


Gly His 


Ser 


Gin 


Gly Leu 


He 


He 


Lys 


Thr 


Glu Phe 


Leu 










155 








160 








165 



37/87 



BNSDOCID: <WO 0204520A2_I_> 



WO 02/04520 



PCT/US01/21448 



Pro 


Thr 


Lys 


Tyr 


Arg 


Gly 


Tyr 


Met 


Leu 


Pro 


Leu 


Ser 


Gin 


Val 


Phe 








170 










175 










180 


Trp 


Leu 


Ala 


Gly 


Ser 


Leu 


Leu 


He 


He 


Gly 


Leu 


Ala 


Ser 


Val 


He 








185 










190 










195 


He 


Pro 


Thr 


He 


Gly 
200 


Trp 


Arg 


Trp 


Leu 


He 
205 


Arg 


Val 


Ala 


Ser 


He 
210 


Pro 


Gly 


He 


He 


Leu 
215 


He 


Val 


Ala 


Phe 


Lys 
220 


Phe 


He 


Pro 


Glu 


Ser 
225 


Ala 


Arg 


Phe 


Asn 


Val 


Ser 


Thr 


Gly 


Asn 


Thr 


Arg Ala Ala 


Leu 


Ala 










230 










235 










240 


Thr 


Leu 


Glu 


Arg 


Val 
245 


Ala 


Lys 


Met 


Asn 


Arg 
250 


Ser 


Val 


Met 


Pro 


Glu 
255 


Gly 


Lys 


Leu 


Val 


Glu 


Pro 


Val 


Leu 


Glu 


Lys 


Arg Gly Arg 


Phe 


Ala 










260 










265 










270 


Asp 


Leu 


Leu 


Asp 


Ala 
275 


Lys 


Tyr 


Leu 


Arg 


Thr 
280 


Thr 


Leu 


Gin 


He 


Trp 
285 


Val 


He 


Trp 


Leu 


Gly 


He 


Ser 


Phe 


Ala 


Tyr 


Tyr 


Gly Val 


He 


Leu 










290 










295 










300 


Ala 


Ser 


Ala 


Glu 


Leu 
305 


Leu 


Glu 


Arg 


Asp 


Leu 
310 


Val 


Cys 


Gly 


Ser 


Lys 
315 


Ser 


Asp 


Ser 


Ala 


Val 


Val 


Val 


Thr 


Gly Gly 


Asp 


Ser 


Gly Glu 


Ser 










320 










325 










330 


Gin 


Ser 


Pro 


c ys 


Tyr 
335 


Cys 


His 


Met 


Phe 


Ala 
340 


Pro 


Ser 


Asp 


Tyr 


Arg 
345 


Thr 


Met 


He 


He 


Ser 
350 


Thr 


He 


Gly 


Glu 


He 
355 


Ala 


Leu 


Asn 


Pro 


Leu 
360 


Asn 


He 


Leu 


Gly 


He 


Asn 


Phe 


Leu 


Gly Arg 


Arg 


Leu 


Ser 


Leu 


Ser 










365 










370 










375 


He 


Thr 


Met 


Gly 


Cys 


Thr 


Ala 


Leu 


Phe 


Cys 


Leu 


Leu 


Leu 


Asn 


He 








380 










385 










390 


Cys 


Thr 


Ser 


Ser 


Ala 
395 


Gly 


Leu 


He 


Gly 


Phe 
400 


Leu 


Phe 


Met 


Leu 


Arg 
405 


Ala 


Leu 


Val 


Ala 


Ala 
410 


Asn 


Phe 


Asn 


Thr 


Val 
415 


Tyr 


He 


Tyr 


Thr 


Ala 
420 


Glu 


Val 


Tyr 


Pro 


Thr 


Thr 


Met 


Arg 


Ala 


Leu 


Gly Met 


Gly Thr 


Ser 








425 










430 










435 


Gly 


Ser 


Leu 


Cys 


Arg 
440 


He 


Gly 


Ala 


Met 


Val 
445 


Ala 


Pro 


Phe 


He 


Ser 
450 


Gin 


Val 


Leu 


Met 


Ser 
455 


Ala 


Ser 


He 


Leu 


Gly 
460 


Ala 


Leu 


Cys 


Leu 


Phe 
465 


Ser 


Ser 


Val 


Cys 


Val 


Val 


Cys 


Ala 


He 


Ser 


Ala 


Phe 


Thr 


Leu 


Pro 








470 










475 










480 


He 


Glu 


Thr 


Lys 


Gly 
485 


Arg 


Ala 


Leu 


Gin 


Gin 
490 


He 


Lys 









<210> 24 
<211> 1494 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3874406CD1 

<400> 24 



Met 


Asn 


Met 


Lys Gin 


Lys 


Ser 


Val 


Tyr 


Gin 


Gin 


Thr 


Lys 


Ala 


Leu 


1 






5 










10 










15 


Leu 


Cys 


Lys 


Asn Phe 
20 


Leu 


Lys 


Lys 


Trp 


Arg 
25 


Met 


Lys 


Arg 


Glu 


Ser 
30 


Leu 


Leu 


Glu 


Trp Gly 


Leu 


Ser 


He 


Leu 


Leu 


Gly Leu 


Cys 


He 


Ala 








35 










40 










45 


Leu 


Phe 


Ser 


Ser Ser 


Met 


Arg 


Asn 


Val 


Gin 


Phe 


Pro 


Gly Met 


Ala 








50 








55 










60 


Pro 


Gin 


Asn 


Leu Gly 


Arg Val 


Asp 


Lys 


Phe 


Asn 


Ser 


Ser 


Ser 


Leu 








65 










70 










75 


Met 


Val 


Val 


Tyr Thr 
80 


Pro 


He 


Ser 


Asn 


Leu 
85 


Thr 


Gin 


Gin 


He 


Met 
90 



38/87 



,0204520A2J_> 



WO 02/04520 PCT/US01/21448 



Asn 


Lys 


Thr 


Ala 


Leu 


Ala 


Pro 


Leu 


Leu 


Lys 


Gly 


Thr 


Ser 


Val 


lie 










95 










100 








105 


Gly 


Ala 


Pro 


Asn 


Lys 
110 


Thr 


His 


Met 


Asp 


Glu 
115 


He 


Leu 


Leu 


Glu 


Asn 
120 


Leu 


Pro 


Tyr 


Ala 


Met 
125 


Gly 


He 


He 


Phe 


Asn 
130 


Glu 


Thr 


Phe 


Ser 


Tyr 
135 


Lys 


Leu 


lie 


Phe 


Phe 
140 


Gin 


Gly 


Tyr 


Asn 


Ser 
145 


Pro 


Leu 


Trp 


Lys 


Glu 
150 


Asp 


Phe 


Ser 


Ala 


His 
155 


Cys 


Trp 


Asp 


Gly 


Tyr 
160 


Gly 


Glu 


Phe 


Ser 


Cys 
165 


Thr 


Leu 


Thr 


Lys 


Tyr 
170 


Trp 


Asn 


Arg 


Gly 


Phe 
175 


Val 


Ala 


Leu 


Gin 


Thr 
180 


Ala 


lie 


Asn 


Thr 


Ala 
185 


He 


He 


Glu 


Val 


Ala 
190 


Leu 


Val 


Phe 


Leu 


Met 
195 


Ser 


Val 


Leu 


Leu 


Lys 
200 


Lys 


Ala 


Val 


Leu 


Thr 

205 


Asn 


Leu 


Val 


Val 


Phe 
210 


Leu 


Leu 


Thr 


Leu 


Phe 
215 


Trp 


Gly 


Cys 


Leu 


Gly 
220 


Phe 


Thr 


Val 


Phe 


Tyr 
225 


Glu 


Gin 


Leu 


Pro 


Ser 
230 


Ser 


Leu 


Glu 


Trp 


lie 
235 


Leu 


Asn 


He 


Cys 


Ser 
240 


Pro 


Phe 


Ala 


Phe 


Thr 
245 


Thr 


Gly 


Met 


He 


Gin 
250 


He 


He 


Lys 


Leu 


Asp 
255 


Tyr 


Asn 


Leu 


Asn 


Gly 
260 


Val 


He 


Phe 


Pro 


Asp 
265 


Pro 


Ser 


Gly 


Asp 


Ser 
270 


Tyr 


Thr 


Met 


He 


Ala 
275 


Thr 


Phe 


Ser 


Met 


Leu 
280 


Leu 


Leu 


Asp 


Gly 


Leu 
285 


lie 


Tyr 


Leu 


Leu 


Leu 
290 


Ala 


Leu 


Tyr 


Phe 


Asp 
295 


Lys 


He 


Leu 


Pro 


Tyr 
300 


Gly 


Asp 


Glu 


Arg 


His 
305 


Tyr 


Ser 


Pro 


Leu 


Phe 
310 


Phe 


Leu 


Asn 


Ser 


Ser 
315 


Ser 


Cys 


Phe 


Gin 


His 
320 


Gin 


Arg 


Thr 


Asn 


Ala 
325 


Lys 


Val 


He 


Glu 


Lys 
330 


Glu 


lie 


Asp 


Ala 


Glu 


His 


Pro 


Ser 


Asp 


Asp 


Tyr 


Phe 


Glu 


Pro 


Val 










335 








340 










345 


Ala 


Pro 


Glu 


Phe 


Gin 
350 


Gly 


Lys 


Glu 


Ala 


He 
355 


Arg 


He 


Arg 


Asn 


Val 
360 


Lys 


Lys 


Glu 


Tyr 


Lys 
365 


Gly 


Lys 


Ser 


Gly 


Lys 
370 


Val 


Glu 


Ala 


Leu 


Lys 
375 


Gly 


Leu 


Leu 


Phe 


Asp 
380 


He 


Tyr 


Glu 


Gly 


Gin 
385 


He 


Thr 


Ala 


He 


Leu 
390 


Gly 


His 


Ser 


Gly 


Ala 
395 


Gly 


Lys 


Ser 


Ser 


Leu 
400 


Leu 


Asn 


He 


Leu 


Asn 
405 


Gly 


Leu 


Ser 


Val 


Pro 
410 


Thr 


Glu 


Gly 


Ser 


Val 
415 


Thr 


He 


Tyr 


Asn 


Lys 
420 


Asn 


Leu 


Ser 


Glu 


Met 
425 


Gin 


Asp 


Leu 


Glu 


Glu 
43 0 


He 


Arg 


Lys 


He 


Thr 

435 


Gly 


Val 


Cys 


Pro 


Gin 
440 


Phe 


Asn 


Val 


Gin 


Phe 
445 


Asp 


lie 


Leu 


Thr 


Val 
450 


Lys 


Glu 


Asn 


Leu 


Ser 
455 


Leu 


Phe 


Ala 


Lys 


He 
460 


Lys 


Gly 


He 


His 


Leu 
465 


Lys 


Glu 


Val 


Glu 


Gin 
470 


Glu 


Val 


Gin 


Arg 


He 
475 


Leu 


Leu 


GlU 


Leu 


Asp 
480 


Met 


Gin 


Asn 


He 


Gin 
485 


Asp 


Asn 


Leu 


Ala 


Lys 
490 


His 


Leu 


Ser 


Glu 


Gly 
495 


Gin 


Lys 


Arg 


Lys 


Leu 
500 


Thr 


Phe 


Gly 


He 


Thr 
505 


He 


Leu 


Gly 


Asp 


Pro 
510 


Gin 


lie 


Leu 


Leu 


Leu 
515 


Asp 


Glu 


Pro 


Thr 


Thr 

520 


Gly 


Leu 


Asp 


Pro 


Phe 
525 


Ser 


Arg 


Asp 


Gin 


Val 
530 


Trp 


Ser 


Leu 


Leu 


Arg 
535 


Glu 


Arg 


Arg 


Ala 


Asp 
540 


His 


Val 


lie 


Leu 


Phe 
545 


Ser 


Thr 


Gin 


Ser 


Met 
550 


Asp 


Glu 


Ala 


Asp 


He 
555 


Leu 


Ala 


Asp 


Arg 


Lys 
560 


Val 


He 


Met 


Ser 


Asn 
565 


Gly 


Arg 


Leu 


Lys 


Cys 
570 


Ala 


Gly 


Ser 


Ser 


He 
575 


Phe 


Leu 


Lys 


Arg 


Arg 
580 


Trp 


Gly 


Leu 


Gly 


Tyr 
585 


His 


Leu 


Ser 


Leu 


His 


Arg 


Asn 


Glu 


He 


Cys 


Asn 


Pro 


Glu 


Gin 


He 



39/87 



BNSDOCID: <WO 0204520A2_I_> 



WO 02/04520 



PCT/US01/21448 



590 



Thr 


Ser 


Phe 


He 


Thr 
605 


His 


His 


Glu 


Asn 


Lys 


Glu 


Lys 
620 


Leu 


Val 


Asn 


Thr 


Phe 


Pro 


Asp 
635 


Leu 


Phe 


Gin 


Gly 


Val 


Thr 


Gly 
650 


Tyr 


Asp 


Val 


Phe 


Met 


Lys 


Leu 
665 


Glu 


Gly 


Glu 


Gin 


Val 


Glu 


Met 
680 


He 


Arg 


Glu 


Leu 


Ala 


His 


Ser 
695 


Ser 


Phe 


Asp 


Met 


Gly 


Leu 


Trp 
710 


Arg 


Met 


Arg 


Phe 


Leu 


Lys 


Leu 
725 


Lys 


Arg 


Leu 


Leu 


Val 


Phe 


Gly 
740 


He 


Ala 


He 


He 


Tyr 


Ala 


Met 


Leu 


Asn 








755 






Asn 


Glu 


Leu 


Tyr 


Phe 
770 


Leu 


Ser 


Arg 


Thr 


Ser 


Leu 


Leu 


He 


He 








785 






Asp 


Phe 


He 


Lys 


Ser 
800 


Leu 


Lys 


Asp 


Asp 


Phe 


Glu 


Asn 
815 


Arg 


Asn 


Gly 


Ala 


He 


He 


Val 
830 


Ser 


Gly 


Val 


Val 


Cys 


Asn 


Thr 
845 


Lys 


Arg 


Asn 


He 


He 


Ser 


Asn 
860 


Gly 


Leu 


His 


He 


Arg 


He 


Glu 
875 


Ser 


Ser 


Leu 


Trp 


Thr 


Gly 


Leu 
890 


Pro 


Asp 


Leu 


Cys 


Ser 


He 


Ser 
905 


Pro 


Tyr 


Tyr 


Lys 


Lys 


Asn 


Ala 
920 


Lys 


Ser 


Thr 


Ser 


Ala 


Tyr 


Trp 
935 


Cys 


Gly 


Phe 


He 


Leu 


He 


Leu 
950 


Leu 


Leu 


Asn 


Met 


Gin 


Tyr 


Leu 
965 


Leu 


He 


Val 


He 


Val 


Thr 


Pro 
980 


Gly 


Tyr 


Tyr 


Met 


He 


Ser 


Phe 


He 


Phe 








995 






Leu 


Trp 


Ser 


Phe 


Tyr 
1010 


Phe 


Phe 


He 


Thr 


Leu 


He 


Asn 
1025 


His 


Phe 


Met 


Val 


Leu 


Val 


Pro 
1040 


Ser 


Tyr 


Leu 


Glu 


Val 


Arg 


Asp 
1055 


Gin 


Glu 


Asn 


Phe 


Glu 


Leu 


Ser 
1070 


Ala 


Thr 


Tyr 


Phe 


Gin 


Thr 


Leu 


Leu 


Phe 



1085 







595 










600 


He 


Pro 


Asp 


Ala 


Lys 


Leu 


Lys 


Thr 






610 










615 


Tyr 


Thr 


Leu 


Pro 


Leu 


Glu 


Arg 


Thr 






625 










630 


Ser 


Asp 


Leu 


Asp 


Lys 


Cys 


Ser 


Asp 






640 










645 


He 


Ser 


Met 


Ser 


Thr 


Leu 


Asn 


Glu 






655 










660 


Gin 


Ser 


Thr 


He 


Glu 


Gin 


Asp 


Phe 






670 










675 


Asp 


Ser 


Glu 


Ser 


Leu 


Asn 


Glu 


Met 




685 










690 


Ser 


Glu 


Met 


Gin 


Thr 


Ala 


Val 


Ser 






700 










705 


Gin 


Val 


Phe 


Ala 


Met 


Ala 


Arg 


Leu 






715 










720 


Gin 


Thr 


Lys 


Val 


Leu 


Leu 


Thr 


Leu 






730 










735 


He 


Phe 


Pro 


Leu 


He 


Val 


Glu 


Asn 






745 










750 


Glu 


Lys 


He 


Asp 


Trp 


Glu 


Phe 


Lys 






760 










765 


Pro 


Gly 


Gin 


Leu 


Pro 


Gin 


Glu 


Pro 




775 










780 


Asn 


Asn 


Thr 


Glu 


Ser 


Asn 


He 


Glu 






790 










795 


His 


Gin 


Asn 


He 


Leu 


Leu 


Glu 


Val 






805 










810 


Gly 


Thr 


Asp Gly Leu 


Ser 


Tyr 


Asn 






820 










825 


Lys 


Gin 


Lys 


Asp 


Tyr 


Arg 


Phe 


Ser 






835 










840 


Leu 


His 


Cys 


Phe 


Pro 


He 


Leu 


Met 






850 










855 


Leu 


Gin 


Met 


Phe 


Asn 


His 


Thr 


Gin 






865 










870 


Pro 


Phe 


Pro 


Leu 


Ser 


His 


He 


Gly 






880 










885 


Gly 


Ser 


Phe 


Phe 


Leu 


Phe 


Leu 


Val 






895 










900 


He 


Thr 


Met 


Gly 


Ser 


He 


Ser 


Asp 






910 










915 


Gin 


Leu 


Trp 


He 


Ser 


Gly 


Leu 


Tyr 






925 










930 


Gin 


Ala 


Leu 


Val 


Asp 


Val 


Ser 


Phe 






940 










945 


Met 


Tyr 


Leu 


He 


Phe 


Tyr 


He 


Glu 




955 










960 


Thr 


Ser 


Gin 


He 


Val 


Phe 


Ala 


Leu 






970 










975 


Ala 


Ala 


Ser 


Leu 


Val 


Phe 


Phe 


He 






985 










990 


Arg 


Lys 


Arg 


Arg 


Lys 


Asn 


Ser 


Gly 






1000 










1005 


Phe 


Ala 


Ser 


Thr 


He 


Met 


Phe 


Ser 






1015 










1020 


Asp 


Leu 


Ser 


He 


Leu 


He 


Thr 


Thr 




1030 










1035 


Thr 


Leu 


Leu 


Gly Phe 


Lys 


Thr 


Phe 






1045 










1050 


His 


Tyr 


Arg 


Glu 


Phe 


Pro 


Glu 


Ala 






1060 










1065 


Asp 


Phe 


Leu 


Val 


Cys 


Phe 


He 


Pro 






1075 










1080 


Val 


Phe 


Val 


Leu Arg 


cys 


Met 


Glu 






1090 










1095 



40/87 



0204520A2_I_> 



WO 02/04520 



PCT/USO 1/2 1448 



Leu 


Lys 


Cys 


Gly Lys 


Lys 


Arg 


Met 


Arg Lys 


Asp 


Pro 


Val 


Phe Arg 








1100 








1105 








1110 


lie 


Ser 


Pro 


Gin Ser 


Arg 


Asp 


Ala 


Lys Pro 


Asn 


Pro 


Glu 


Glu Pro 








1115 








1120 








1125 


He 


Asp 


Glu 


Asp Glu 


Asp 


He 


Gin 


Thr Glu 


Arg 


He 


Arg 


Thr Val 








1130 








1135 








1140 


Thr 


Ala 


Leu 


Thr Thr 


Ser 


He 


Leu Asp Glu 


Lys 


Pro 


Val 


He He 








1145 








1150 








1155 


Ala 


Ser 


Cys 


Leu His 


Lys 


Glu 


Tyr 


Ala Gly Gin 


Lys 


Lys 


Ser Cys 








1160 








1165 








1170 


Phe 


Ser 


Lys 


Arg Lys 


Lys 


Lys 


He 


Ala Ala 


Arg 


Asn 


He 


Ser Phe 








1175 








1180 








1185 


Cys 


Val 


Gin 


Glu Gly 


Glu 


He 


Leu 


Gly Leu 


Leu 


Gly 


Pro 


ser Gly 








1190 








1195 








1200 


Ala 


Gly 


Lys 


Ser Ser 


Ser 


He 


Arg 


Met He 


Ser 


Gly 


He 


Thr Lys 








1205 








1210 








1215 


Pro 


Thr 


Ala 


Gly Glu 


Val 


Glu 


Leu 


Lys Gly 


Cys 


Ser 


Ser 


Val Leu 








1220 








1225 








1230 


Gly 


His 


Leu 


Gly Tyr 


Cys 


Pro 


Gin 


Glu Asn 


Val 


Leu 


Trp 


Pro Met 








1235 








1240 








1245 


Leu 


Thr 


Leu 


Arg Glu 


His 


Leu 


Glu 


Val Tyr 


Ala 


Ala 


Val 


Lys Gly 








1250 








1255 








1260 


Leu 


Arg 


Glu 


Ala Asp 


Ala 


Arg 


Leu 


Ala He 


Ala 


Arg 


Leu 


Val Ser 








1265 








1270 








1275 


Ala 


Phe 


Lys 


Leu His 


Glu 


Gin 


Leu 


Asn Val 


Pro 


Val 


Gin 


Lys Leu 








1280 








1285 








1290 


Thr 


Ala 


Gly 


He Thr 


Arg 


Lys 


Leu 


Cys Phe 


Val 


Leu 


Ser 


Leu Leu 








1295 








1300 








1305 


Gly 


Asn 


Ser 


Pro Val 


Leu 


Leu 


Leu Asp Glu 


Pro 


Ser 


Thr 


Gly He 








1310 








1315 








1320 


Asp 


Pro 


Thr 


Gly Gin 


Gin 


Gin 


Met 


Trp Gin 


Ala 


He 


Gin 


Ala Val 








1325 








1330 








1335 


Val 


Lys 


Asn 


Thr Glu 


Arg 


Gly 


Val 


Leu Leu 


Thr 


Thr 


His 


Asn Leu 








1340 








1345 








1350 


Ala 


Glu 


Ala 


Glu Ala 


Leu 


Cys 


Asp 


Arg Val 


Ala 


He 


Met 


Val Ser 








1355 








1360 








1365 


Gly 


Arg 


Leu 


Arg Cys 


He 


Gly 


Ser 


He Gin 


His 


Leu 


Lys 


Asn Lys 








1370 








1375 








1380 


Leu 


Gly 


Lys 


Asp Tyr 


He 


Leu 


Glu 


Leu Lys 


Val 


Lys 


Glu 


Thr Ser 








1385 








1390 








1395 


Gin 


Val 


Thr 


Leu Val 


His 


Thr 


Glu 


He Leu 


Lys 


Leu 


Phe 


Pro Gin 








1400 








1405 








1410 


Ala 


Ala 


Gly 


Gin Gin 


Arg 


Tyr 


Ser 


Ser Leu 


Leu 


Thr 


Tyr 


Lys Leu 








1415 








1420 








1425 


Pro 


Val 


Ala 


Asp Val 


Tyr 


Pro 


Leu 


Ser Gin 


Thr 


Phe 


His 


Lys Leu 








1430 








1435 








1440 


Glu 


Ala 


Val 


Lys His 


Asn 


Phe 


Asn 


Leu Glu 


Glu 


Tyr 


Ser 


Leu Ser 








1445 








1450 








1455 


Gin 


Cys 


Thr 


Leu Glu 


Lys 


Val 


Phe 


Leu Glu 


Leu 


Ser 


Lys 


Glu Gin 








1460 








1465 


i 






1470 


Glu 


Val 


Gly 


Asn Phe 


Asp 


Glu 


Glu 


He Asp 


Thr 


Thr 


Met 


Arg Trp 








1475 








1480 








1485 


Lys 


Leu 


Leu 


Pro His 


Ser 


Asp 


Glu 


Pro 
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Thr 


Pro 


Gly 


Leu 


GlU 


Ala 


Val 


Pro 


Pro 


Val 


Ala 


Pro 


Pro 


Pro 


Ala 








20 










25 










30 


Thr 


Ala 


Ala 


Ser 


Gly 


Pro 


He 


Pro 


Lys 


Ser 


Gly 


Pro 


Glu 


Pro 


Lys 










35 










40 










45 


Arg 


Arg 


His 


Leu 


Gly 


Thr 


Leu 


Leu 


Gin 


Pro 


Thr 


Val 


Asn 


Lys 


Phe 










50 










55 










60 


Ser 


Leu 


Arg 


Val 


Phe 


Gly 


Ser 


His 


Lys 


Ala 


Val 


Glu 


He 


Glu 


Gin 










65 










70 










75 


Glu 


Arg 


Val 


Lys 


Ser 


Ala 


Gly 


Ala 


Trp 


He 


He 


His 


Pro 


Tyr 


Ser 








80 










85 










90 


Asp 


Phe 


Arg 


Phe 


Tyr 


Trp 


Asp 


Leu 


He 


Met 


Leu 


Leu 


Leu 


Met 


Val 










95 










100 










105 


Gly 


Asn 


Leu 


He 


Val 


Leu 


Pro 


Val 


Gly 


He 


Thr 


Phe 


Phe 


Lys 


Glu 








110 










115 










120 


Glu 


Asn 


Ser 


Pro 


Pro 


Trp 


He 


Val 


Phe 


Asn 


Val 


Leu 


Ser 


Asp 


Thr 










125 










130 










135 


Phe 


Phe 


Leu 


Leu 


Asp 


Leu 


Val 


Leu 


Asn 


Phe 


Arg 


Thr 


Gly 


He 


Val 










140 










145 










150 


Val 


Glu 


Glu 


Gly 


Ala 


Glu 


He 


Leu 


Leu 


Ala 


Pro 


Arg 


Ala 


He 


Arg 








155 










160 










165 


Thr 


Arg 


Tyr 


Leu 


Arg 


Thr 


Trp 


Phe 


Leu 


Val 


Asp 


Leu 


He 


Ser 


Ser 










170 










175 










180 


He 


Pro 


Val 


Asp 


Tyr 


He 


Phe 


Leu 


Val 


Val 


Glu 


Leu 


Glu 


Pro 


Arg 








185 










190 










195 


Leu 


Asp 


Ala 


Glu 


Val 


Tyr 


Lys 


Thr 


Ala 


Arg 


Ala 


Leu 


Arg 


He 


Val 








200 








205 










210 


Arg 


Phe 


Thr 


Lys 


He 


Leu 


Ser 


Leu 


Leu 


Arg 


Leu 


Leu 


Arg 


Leu 


Ser 






215 










220 










225 


Arg 


Leu 


He 


Arg 


Tyr 


He 


His 


Gin 


Trp 


Glu 


Glu 


He 


Phe 


His 


Met 






230 










235 










240 


Thr 


Tyr 


Asp 


Leu 


Ala 


Ser 


Ala 


Val 


Val 


Arg 


He 


Phe 


Asn 


Leu 


He 






245 










250 










255 


Gly 


Met 


Met 


Leu 


Leu 


Leu 


Cys 


His 


Trp 


Asp 


Gly 


Cys 


Leu 


Gin 


Phe 










260 










265 










270 


Leu 


Val 


Pro 


Met 


Leu 


Gin 


Asp 


Phe 


Pro 


Pro 


Asp 


Cys 


Trp 


Val 


Ser 










275 










280 










285 


He 


Asn 


His 


Met 


Val 


Asn 


His 


Ser 


Trp 


Gly 


Arg 


Gin 


Tyr 


Ser 


His 










290 








295 










300 


Ala 


Leu 


Phe 


Lys 


Ala 


Met 


Ser 


His 


Met 


Leu 


Cys 


He 


Gly 


Tyr 


Gly 










305 










310 










315 


Gin 


Gin 


Ala 


Pro 


Val 


Gly 


Met 


Pro 


Asp 


Val 


Trp 


Leu 


Thr 


Met 


Leu 










320 










325 










330 


Ser 


Met 


He 


Val 


Gly 


Ala 


Thr 


Cys 


Tyr 


Ala 


Met 


Phe 


He 


Gly 


His 










335 










340 










345 


Ala 


Thr 


Ala 


Leu 


He 


Gin 


Ser 


Leu 


Asp 


Ser 


Ser 


Arg Arg 


Gin 


Tyr 










350 










355 










360 


Gin 


Glu 


Lys 


Tyr 


Lys 


Gin 


Val 


Glu 


Gin 


Tyr 


Met 


Ser 


Phe 


His 


Lys 










365 










370 










375 


Leu 


Pro 


Ala Asp 


Thr 


Arg 


Gin 


Arg 


He 


His 


Glu 


Tyr 


Tyr 


Glu 


His 










380 










385 










390 


Arg 


Tyr 


Gin 


Gly 


Lys 


Met 


Phe 


Asp 


Glu 


Glu 


Ser 


He 


Leu 


Gly 


Glu 










395 










400 










405 


Leu 


Ser 


Glu 


Pro 


Leu 


Arg 


Glu 


Glu 


He 


He 


Asn 


Phe 


Thr 


Cys 


Arg 










410 










415 










420 


Gly 


Leu 


Val 


Ala 


His 


Met 


Pro 


Leu 


Phe 


Ala 


His 


Ala 


Asp 


Pro 


Ser 








425 










430 










435 


Phe 


Val 


Thr 


Ala 


Val 


Leu 


Thr 


Lys 


Leu 


Arg 


Phe 


Glu 


Val 


Phe 


Gin 










440 










445 










450 


Pro 


Gly 


Asp 


Leu 


Val 


Val 


Arg 


Glu 


Gly 


Ser 


Val 


Gly Arg 


Lys 


Met 










455 










460 










465 


Tyr 


Phe 


He 


Gin 


His 


Gly 


Leu 


Leu 


Ser 


Val 


Leu 


Ala Arg 


Gly 


Ala 








470 








475 










480 


Arg 


Asp 


Thr 


Arg 


Leu 


Thr 


Asp 


Gly 


Ser 


Tyr 


Phe 


Gly Glu 


He 


Cys 










485 










490 










495 


Leu 


Leu 


Thr 


Arg 


Gly 


Arg 


Arg 


Thr 


Ala 


Ser 


Val 


Arg Ala 


Asp 


Thr 










500 










505 










510 


Tyr 


Cys 


Arg 


Leu 


Tyr 


Ser 


Leu 


Ser 


Val 


Asp 


His 


Phe 


Asn 


Ala 


Val 



42/87 



0204520A2_I_> 



WO 02/04520 



PCT/US01/21448 



515 



Leu 


Glu 


Glu 


Phe 


Pro 
530 


Met 


Met 


Arg 


Met 


Asp 


Arg 


Leu 


Leu 
545 


Arg 


lie 


Gly 


Arg 


Lys 


Arg 


Ser 


Glu 
560 


Pro 


Ser 


Pro 


Glu 


Gin 


His 


Leu 


Val 
575 


Gin 


His 


Asp 


Arg 


Gly 


Arg 


Ala 


Pro 
590 


Ser 


Thr 


Gly 


Val 


Leu 


Trp 


Glu 


Pro 
605 


Leu 


Val 


His 


Val 


Thr 


Ser 


Asn 


Val 
620 


Ala 


He 


Ala 


Leu 


Pro 


Leu 


Ser 


Pro 
635 


Asp 


Ser 


Pro 


Ala 


Trp 


Arg 


Ser 


Ala 
650 


Gly 


Ser 


Pro 


Arg 


Ala 


Gly 


Pro 


Trp 
665 


Ala 


Ser 


Thr 


Ala 


Arg 


Thr 


Leu 


His 
680 


Ala 


Ser 


Leu 


Val 


Ser 


Leu 


Leu 


Gly 
695 


Pro 


Pro 


Pro 


Gly 


Pro 


Arg 


Gly 


Arg 
710 


Pro 


Leu 


Ser 


Gin 


Arg 


Ala 


Thr 


Gly 
725 


Asp 


Gly 


Ser 


Ser 


Glu 


Arg 


Leu 


Pro 


Pro 


Ser 


Gly 










740 






Thr 


Ala 


Gin 


Pro 


Pro 
755 


Arg 


Pro 


Pro 


Arg 


Gly 


Leu 


Gin 


Leu 


Ser 


Ala 


Asn 
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Met 


Ala 


Glu 


Gly 


Glu 


Arg 


Gly 


Ala 


1 








5 








Ala 


Trp 


Leu 


Ala 


Asp 
20 


Val 


Ala 


Leu 


Gin 


Gly 


Arg 


Arg 


Asp 
35 


Arg 


Gly 


Gly 


Gly 


Gly 


Ser 


Gly 


Val 
50 


Gly 


Asp 


Ser 


Pro 


Ser 


Ala 


Pro 


Pro 
65 


Ala 


Arg 


Arg 


His 


Ser 


Pro 


Pro 


Val 
80 


Leu 


Pro 


Leu 


Gly 


Gly 


Leu 


Thr 


Phe 
95 


Gly 


Tyr 


Glu 


Leu 


Leu 


Pro 


Leu 


Gin 
110 


Leu 


Asp 


Phe 


Glu 


Phe 


Leu 


Val 


Gly 
125 


Ser 


Leu 


Leu 


Leu 


Val 


Gly 


Gly 


Phe 


Leu 


lie 


Asp 










140 






He 


Leu 


Gly 


Ser 


Asn 


Leu 


Val 


Leu 





520 






525 


Arg 


Ala 


Phe 


Glu Thr Val 


Ala 




535 






540 


Lys 


Lys 


Asn 


Ser He Leu 


Gin 




550 






555 


Gly 


Ser 


Ser 


Gly Gly He 


Met 




565 






570 


Arg 


Asp 


Met 


Ala Arg Gly 


Val 




580 






585 


Ala 


Gin 


Leu 


Ser Gly Lys 


Pro 




595 






600 


Ala 


Pro 


Leu 


Gin Ala Ala 


Ala 




610 






615 


Leu 


Thr 


His 


Gin Arg Gly 


Pro 




625 






630 


Ala 


Thr 


Leu 


Leu Ala Arg 


Ser 




640 






645 


Ala 


Ser 


Pro 


Leu Val Pro 


Val 




655 






660 


Ser 


Arg 


Leu 


Pro Ala Pro 


Pro 




670 






675 


Ser 


Arg 


Ala 


Gly Arg" Ser 


Gin 




685 






690 


Gly 


Gly 


Gly Gly Arg Arg 


Leu 




700 






705 


Ala 


Ser 


Gin 


Pro Ser Leu 


Pro 




715 






720 


Pro 


Gly 


Arg 


Lys Gly Ser 


Gly 




730 






735 


Leu 


Leu 


Ala 


Lys Pro Pro 


Arg 




745 






750 


Val 


Pro 


Glu 


Pro Ala Thr 


Pro 




760 






765 


Met 










Asp 


Val 


Pro 


His Gly Leu 


Gly 




10 






15 


Ala 


Ala 


Leu Arg Ala Gly 


Gly 




25 






30 


Gly 


Gly 


Pro 


Glu Ser Leu 


Ser 




40 






45 


Gly 


Gly 


Gly 


Cys Ala Pro 


Gly 




55 






60 


Arg 


Val 


Pro 


Leu Ala Met 


Gly 




70 






75 


Cys 


Ala 


Ser 


Val Ser Leu 


Leu 




85 






90 


Leu 


Ala 


Val 


He Ser Gly 


Ala 




100 




105 


Gly 


Leu 


Ser 


Cys Leu Glu 


Gin 




115 






120 


Leu 


Gly 


Ala 


Leu Leu Ala 


Ser 




130 






135 


Cys 


Tyr 


Gly Arg Lys Gin 


Ala 




145 






150 


Leu 


Ala 


Gly 


Ser Leu Thr 


Leu 
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Gly 


Leu 


Ala 


Val 


Gly 


i lit: 


Val 


Ser 


Glu 


Leu 




Glu 


Leu 


Asn 


±y x. 


Phe 


Glv 


* -^iv 


LeU 


Phe 


Leu 




IXC 


Pro 
Jr J. w 


7\ v-rr 
nl y 


Jr JL \J 


jrii- y 


Met 


Arg 


Gly 


Gl -n 

OJ.ll 


JLJ tr H 




Phe 


Ser 


S er 


Ser 


val 
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Leu 




Cys 


Ala 


Sex* 


Phe 


Ala 




A C3TI 


Ala 






A on 
nop 


Gin 


Arg 


Glu 


•T J- *w> 


ni y 


Ser 


Ser 


Ala. 


Leu 




ax. y 


±L hf 


Phe 


Ser 


Phe 


lie 




Pro 


Ser 


Phe 


Asn 


Asp 


Leu. 


lie 


Gly 


Leu 


Thr 


Pro 


Glu 


Thr 


Gin 


Lys 


Arg 


Thr 


Gly 


He 





155 




Gly 


Ser 


Leu 




17 0 




Ala 


He 


Ser 




185 




Leu 


Val 


Gly 




200 




Ala 


Gly 


He 




215 




Ala 


Leu 


Ala 




230 




Ala 


Thr 


Ala 




245 




Pro 


Ala 


Gly 




260 




Leu 


Gin 


Gly 




275 




Tyr 


Ser 


Phe 




290 




Arg 


Thr 


Thr 




305 




Gly 


Gin 


Pro 




320 




Val 


Gly 


Phe 




335 




Leu 


Gly 


Ala 




350 




Val 


Asp 


Arg 




365 




Leu 


Met 


Ala 




380 




Val 


Pro 


Met 




395 




Thr 


Gly 


Gin 




410 




Ser 


Ser 


Leu 




425 




Pro 


He 


Leu 




440 




Gly 


Asp 


Pro 




455 




Pro 


Gly 


Pro 




470 




Thr 


Ala 


Leu 




485 




Gly 


Phe 


Gly 




500. 




Val 


Glu 


He 




515 




Trp 


Ala 


Ala 




530 




Gly 


Thr 


He 




545 




Ala 


Val 


Leu 




560 




Lys 


Gly 


Gin 




575 




Arg 


Phe 


Thr 




590 




Pro 


Tyr 


Ser 




605 





Ala Trp Leu 
Leu Ser Ser 
Pro Arg Gin 
Thr Val Gly 
Gly Thr Pro 
Pro Ala Val 
Thr Asp Glu 
Gly Glu Ala 
Leu Asp Leu 
Val Gly Leu 
Asn Val Leu 
His Gly Gly 
Val Lys Val 
Ala Gly Arg 
Leu Ser Val 
Asp Ser Gly 
Thr Gly Leu 
Pro Pro He 
Ser Thr Ala 
Ser Ala Pro 
Pro Leu Pro 
Leu Cys Leu 
Pro Val Thr 
Arg Gly Arg 
Asn Leu Phe 
Gly Leu Ser 
Gly Leu Gly 
Ser Leu Ala 
Leu Ser Phe 
Arg He Glu 



160 






Val 


Leu 


Gly 


175 






Met 


Ala 


Cys 


190 






Arg 


Gly 


Val 


205 






He 


Leu 


Leu 


220 






Trp 


Gly 


Trp 


235 






Leu 


Gin 


Ser 


250 






Thr 


Ala 


Thr 


265 






Pro 


Lys 


Leu 


280 






Phe 


Arg 


Ala 


295 






Gly 


Leu 


Val 


310 






Cys 


Tyr 


Ala 


325 






Ser 


Ser 


Ala 


340 






Ala 


Ala 


Thr 


355 






Arg 


Ala 


Leu 


370 






Ser 


Gly 


He 


385 






Pro 


Ser 


Cys 


400 






Pro 


Gly 


Asp 


415 






Pro 


Arg 


Thr 


430 






Lys 


Lys 


Thr 


445 






Pro 


Arg 


Leu 


460 






Ala 


Arg 


Gly 


475 






Met 


Val 


Phe 


490 






Trp 


Leu 


Val 


505 






Ala 


Phe 


Ala 


520 






He 


Ser 


Leu 


535 






Trp 


Thr 


Phe 


550 






Phe 


He 


Tyr 


565 






Glu 


He 


Asp 


580 






Gly 


His 


Arg 


595 






He 


Ser 


Ala 


610 











165 


Arg 


Ala 


Val 






180 


Cys 


He 


Tyr 






195 


Leu 


Val 


Ser 






210 


Ser 


Tyr 


Ala 






225 


Arg 


His 


Met 






240 


Leu 


Ser 


Leu 






255 


His 


Lys 


Asp 






270 


Gly 


Pro 


Gly 






285 


Arg 


Asp 


Asn 






300 


Leu 


Phe 


Gin 






315 


Ser 


Thr 


He 






330 


Val 


Leu 


Ala 






345 


Leu 


Thr 


Ala 






360 


Leu 


Leu 


Ala 






375 


Gly 


Leu 


Val 






390 


Leu 


Ala 


Val 






405 


Ser 


Gly 


Leu 






420 


Asn 


Glu 


Asp 






435 


Lys 


Pro 


His 






450 


Ala 


Leu 


Ser 






465 


His 


Ala 


Leu 






480 


Val 


Ser 


Ala 






495 


Leu 


Ser 


Glu 






510 


Phe 


Cys 


Asn 






525 


Ser 


Phe 


Leu 






540 


Leu 


Leu 


Tyr 






555 


Leu 


Phe 


Val 






570 


Gin 


Gin 


Phe 






585 


Gin 


Asn 


Ser 






600 


Ala 


Ser 
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Met 


Arg 


Phe 


Arg 


Lys 


1 








5 


Val 


Phe 


Ser 


Pro 


Thr 
20 


Pro 


Ser 


Leu 


Thr 


Met 
35 


Trp 


Lys 


Asn 


Phe 


Met 
50 


Val 


Glu 


Leu 


Leu 


Trp 
65 


Val 


Arg 


His 


Ser 


His 
80 


Pro 


Asn 


Lys 


Pro 


Leu 
95 


Gly 


Leu 


He 


Cys 


Asn 
110 


Pro 


Gly Glu 


Glu 


Pro 










125 


Val 


Ser 


Arg 


Leu 


Leu 
140 


Ser 


Ala 


His 


Arg 


Thr 
155 


Leu 


Arg 


Ala 


Ala 


Arg 
170 


Ser 


Pro 


Leu 


Glu 


Pro 
185 


Ser 


Leu 


Leu 


Arg 


Thr 
200 


Gin 


Glu 


Pro 


Leu 


His 
215 


Gin 


Glu 


Leu 


Leu 


Ala 
230 


Leu 


Gin 


Arg 


Pro 


Arg 
245 


Glu 


Ala 


Leu 


Cys 


Ser 
260 


Ser 


Leu 


Asn 


Trp 


Tyr 
275 


Gin 


Glu 


Pro 


Glu 


Ser 
290 


Cys 


Ser 


Glu 


Leu 


He 
305 


Leu 


Leu 


Trp 


Arg 


Arg 
320 


Phe 


Ala 


Pro 


Asp 


Thr 
335 


Asn 


Arg 


Thr 


Phe 


Glu 
350 


Val 


Trp 


Glu 


Met 


Leu 
365 


Ser 


Ser 


Asn 


Val 


Ala 
380 


Glu 


Gly Arg 


Arg 


Gin 










395 


Ala 


Leu Arg 


Ser 


Phe 










410 


Gin 


Asp Ala 


His 


Ala 










425 


Arg 


Val 


Thr 


Glu 


Cys 
440 


Ser 


Glu 


Ala 


Ala 


Leu 
455 
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Gly Gin 


Glu 


Leu 


Pro 










10 


Val 


Val 


Leu 


Thr 


Ser 










25 


Ala 


Phe 


Trp 


Thr 


Gin 










40 


Tyr 


Arg 


Arg Arg 


Gin 










55 


Pro 


Leu 


Phe 


Leu 


Phe 










70 


Pro 


Pro 


Leu 


Glu 


His 










85 


Pro 


Ser 


Ala Gly 


Thr 










100 


Val 


Asn 


Asn 


Thr 


Cys 










115 


Gly Arg 


Leu 


Ser 


Asn 










130 


Ala Asp 


Ala 


Arg 


Thr 










145 


Leu 


Ala 


Gly 


Leu 


Gly 










160 


Ser 


Thr 


Ala 


Gin 


Pro 










175 


Pro 


Met 


Leu Asp 


Val 










190 


GlU 


Ser 


Leu Gly 


Leu 










205 


Ser 


Leu 


Leu 


Glu 


Ala 










220 


Leu Arg 


Ser 


Leu 


Val 










235 


Gly 


Thr 


Ser 


Gly 


Pro 










250 


Val 


Arg 


Gly 


Pro 


Ser 










265 


Glu 


Ala 


Ser 


Asp 


Leu 










280 


Ala 


Leu 


Pro 


Asp 


Ser 










295 


Gly Ala 


Leu 


Asp 


Ser 










310 


Leu 


Lys 


Pro 


Leu 


He 










325 


Pro 


Phe 


Thr 


Arg 


Lys 










340 


Glu 


Leu 


Thr 


Leu 


Leu 










355 


Gly 


Pro 


Arg 


He 


Phe 










370 


Met 


Leu 


Gin 


Arg 


Leu 










385 


Pro 


Arg 


Pro 


Gly 


Gly 










400 


Leu 


Asp 


Pro 


Gly 


Ser 










415 


Asp 


Val 


Gly His 


Leu 










43 0 


Leu 


Ser 


Leu Asp 


Lys 










445 


Val 


Ser 


Arg 


Ala 


Leu 










460 
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Ala 


Ala 


Ala 


Pro 


His 
15 


Leu 


Ser 


Arg 


Pro 


Leu 
30 


Leu 


Met 


Leu 


Leu 


Leu 
45 


Pro 


Val 


Gin 


Leu 


Leu 
60 


Phe 


He 


Leu 


Val 


Ala 
75 


His 


Glu 


Cys 


His 


Phe 
90 


Val 


Pro 


Trp 


Leu 


Gin 
105 


Phe 


Pro 


Gin 


Leu 


Thr 
120 


Phe 


Asn 


Asp 


Ser 


Leu 
135 


Val 


Leu Gly Gly 


Ala 










150 


Lys 


Leu 


He 


Ala 


Thr 
165 


Gin 


Pro 


Thr 


Lys 


Gin 
180 


Ala 


Glu 


Leu 


Leu 


Thr 
195 


Ala 


Leu 


Gly Gin 


Ala 










210 


Ala 


Glu 


Asp 


Leu 


Ala 
225 


Glu 


Leu 


Arg 


Ala 


Leu 

•240 


Leu 


Glu 


Leu 


Leu 


Ser 
255 


Ser 


Thr 


Val 


Gly 


Pro 
270 


Met 


GlU 


Leu 


Val 


Gly 
285 


Ser 


Leu 


Ser 


Pro 


Ala 
300 


His 


Pro 


Leu 


Ser 


Arg 
315 


Leu Gly 


Lys 


Leu 


Leu 










330 


Leu 


Met 


Ala 


Gin 


Val 
345 


Arg Asp Val 


Arg 


Glu 










360 


Thr 


Phe 


Met 


Asn 


Asp 
375 


Leu 


Gin 


Met 


Gin 


Asp 
390 


Arg Asp 


His 


Met 


Glu 










405 


Gly Gly Tyr 


Ser 


Trp 










420 


Val 


Gly 


Thr 


Leu 


Gly 
435 


Leu 


Glu 


Ala 


Ala 


Pro 
450 


Gin 


Leu 


Leu 


Ala 


Glu 
465 
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His 


Arg 


Phe 


Trp 


Ala 


Gly 


Val 


Val 


Phe 


Leu 


Gly 


Pro 


Glu 


Asp 


Ser 






470 










475 










480 


Ser 


Asp 


Pro 


Thr 


Glu 
485 


His 


Pro 


Thr 


Pro 


Asp 
490 


Leu 


Gly 


Pro 


Gly 


His 
495 


Val 


Arg 


He 


Lys 


He 
500 


Arg 


Met 


Asp 


He 


Asp 
505 


Val 


Val 


Thr 


Arg 


Thr 
510 


Asn 


Lys 


He 


Arg 


Asp 
515 


Arg 


Phe 


Trp 


Asp 


Pro 
520 


Gly 


Pro 


Ala 


Ala 


Asp 
525 


Pro 


Leu 


Thr 


Asp 


Leu 
53 0 


Arg 


Tyr 


Val 


Trp 


Gly 
535 


Gly 


Phe 


Val 


Tyr 


Leu 
540 


Gin 


Asp 


Leu 


Val 


Glu 
545 


Arg 


Ala 


Ala 


Val 


Arg 
550 


Val 


Leu 


Ser 


Gly 


Ala 
555 


Asn 


Pro 


Arg 


Ala 


Gly 
560 


Leu 


Tyr 


Leu 


Gin 


Gin 
565 


Met 


Pro 


Tyr 


Pro 


Cys 
570 


Tyr 


Val 


Asp 


Asp 


Val 
575 


Phe 


Leu 


Arg 


Val 


Leu 
580 


Ser 


Arg 


Ser 


Leu 


Pro 
585 


Leu 


Phe 


Leu 


Thr 


Leu 
590 


Ala 


Trp 


He 


Tyr 


Ser 
595 


Val 


Thr 


Leu 


Thr 


Val 
600 


Lys 


Ala 


Val 


Val 


Arg 
605 


Glu 


Lys 


Glu 


Thr 


Arg 
610 


Leu 


Arg 


Asp 


Thr 


Met 
615 


Arg 


Ala 


Met 


Gly 


Leu 
620 


Ser 


Arg 


Ala 


Val 


Leu 
625 


Trp 


Leu 


Gly 


Trp 


Phe 
630 


Leu 


Ser 


Cys 


Leu 


Gly 
635 


Pro 


Phe 


Leu 


Leu 


Ser 
640 


Ala 


Ala 


Leu 


Leu 


Val 
645 


Leu 


Val 


Leu 


Lys 


Leu 
650 


Gly 


Asp 


He 


Leu 


Pro 
655 


Tyr 


Ser 


His 


Pro 


Gly 
660 


Val 


Val 


Phe 


Leu 


Phe 
665 


Leu 


Ala 


Ala 


Phe 


Ala 
670 


Val 


Ala 


Thr 


Val 


Thr 
675 


Gin 


Ser 


Phe 


Leu 


Leu 
680 


Ser 


Ala 


Phe 


Phe 


Ser 
685 


Arg 


Ala 


Asn 


Leu 


Ala 
690 


Ala 


Ala 


Cys 


Gly 


Gly 
695 


Leu 


Ala 


Tyr 


Phe 


Ser 
700 


Leu 


Tyr 


Leu 


Pro 


Tyr 
705 


Val 


Leu 


Cys 


Val 


Ala 
710 


Trp 


Arg 


Asp 


Arg 


Leu 
715 


Pro 


Ala 


Gly 


Gly 


Arg 
720 


Val 


Ala 


Ala 


Ser 


Leu 
725 


Leu 


Ser 


Pro 


Val 


Ala 
730 


Phe 


Gly 


Phe 


Gly 


Cys 
735 


Glu 


Ser 


Leu 


Ala 


Leu 
740 


Leu 


Glu 


Glu 


Gin 


Gly 
745 


Glu 


Gly 


Ala 


Gin 


Trp 
750 


His 


Asn 


Val 


Gly 


Thr 


Arg 


Pro 


Thr 


Ala 


Asp 


Val 


Phe 


Ser 


Leu 


Ala 








755 










760 










765 


Gin 


Val 


Ser 


Gly 


Leu 
770 


Leu 


Leu 


Leu 


Asp 


Ala 
775 


Ala 


Leu 


Tyr 


Gly 


Leu 
780 


Ala 


Thr 


Trp 


Tyr 


Leu 
785 


Glu 


Ala 


Val 


Cys 


Pro 
790 


Gly 


Gin 


Tyr 


Gly 


He 
795 


Pro 


Glu 


Pro 


Trp 


Asn 
800 


Phe 


Pro 


Phe 


Arg 


Arg 
805 


Ser 


Tyr 


Trp 


Cys 


Gly 
810 


Pro 


Arg 


Pro 


Pro 


Lys 
815 


Ser 


Pro 


Ala 


Pro 


Cys 
820 


Pro 


Thr 


Pro 


Leu 


Asp 
825 


Pro 


Lys 


Val 


Leu 


Val 
830 


Glu 


Glu 


Ala 


Pro 


Pro 
835 


Gly 


Leu 


Ser 


Pro 


Gly 
840 


Val 


Ser 


Val 


Arg 


Ser 
845 


Leu 


Glu 


Lys 


Arg 


Phe 
850 


Pro 


Gly 


Ser 


Pro 


Gin 
855 


Pro 


Ala 


Leu 


Arg 


Gly 
860 


Leu 


Ser 


Leu 


Asp 


Phe 
865 


Tyr 


Gin 


Gly 


His 


He 
870 


Thr 


Ala 


Phe 


Leu 


Gly 
875 


His 


Asn 


Gly 


Ala 


Gly 
880 


Lys 


Thr 


Thr 


Thr 


Leu 
885 


Ser 


He 


Leu 


Ser 


Gly 
890 


Leu 


Phe 


Pro 


Pro 


Ser 
895 


Gly 


Gly 


Ser 


Ala 


Phe 
900 


He 


Leu 


Gly 


His 


Asp 
905 


Val 


Arg 


Ser 


Ser 


Met 
910 


Ala 


Ala 


He 


Arg 


Pro 
915 


His 


Leu 


Gly 


Val 


Cys 
920 


Pro 


Gin 


Tyr 


Asn 


Val 
925 


Leu 


Phe 


Asp 


Met 


Leu 
930 


Thr 


Val 


Asp 


Glu 


His 


Val 


Trp 


Phe 


Tyr 


Gly Arg 


Leu 


Lys 


Gly 


Leu 










935 










940 










945 


Ser 


Ala 


Ala 


Val 


Val 


Gly 


Pro 


Glu 


Gin 


Asp 


Arg 


Leu 


Leu 


Gin 


Asp 










950 








955 










960 


Val 


Gly 


Leu 


Val 


Ser 


Lys 


Gin 


Ser 


Val 


Gin 


Thr 


Arg 


His 


Leu 


Ser 
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965 


Gly 


Gly 


Met 


Gin Arg 








980 


Gly 


Ser 


Gin 


Val Val 








995 


Pro 


Ala 


Ser 


Arg Arg 








1010 


Glu 


Gly 


Arg 


Thr Leu 








1025 


Glu 


Leu 


Leu Gly Asp 








1040 


Cys 


Cys 


Cys 


Gly Ser 








1055 


Gly 


Tyr 


Tyr 


Leu Thr 








1070 


Asn 


Glu 


Lys 


Ala Asp 








1085 


Gin 


Glu 


Lys 


Lys Asn 








1100 


Gin 


Leu 


Leu 


Ala Leu 








1115 


Val 


Glu 


Glu 


Leu Pro 








1130 


Gly 


Ala 


His 


Asp Gly 








1145 


Thr 


Arg 


Leu 


Ala Glu 








1160 


Thr 


Ser 


Leu 


Glu Glu 








1175 


Ala 


Asp 


Thr 


Asp Met 








1190 


Thr 


Gly 


He 


Ala Gly 








1205 


Pro 


Gin 


Glu 


Thr Ala 








1220 


Pro 


Glu 


Thr 


Asp Gin 








1235 


Gin 


Gly 


Trp 


Ala Leu 








1250 


Lys 


Arg 


Phe 


Leu Leu 








1265 


Gin 


He 


Val 


Leu Pro 








1280 


Ser 


Leu 


He 


Val Pro 








1295 


Ser 


Pro 


Thr 


Met Tyr 








1310 


Ala 


Pro 


Gly Asp Pro 








1325 


Gin 


Glu 


Ala 


Gly Leu 








1340 


Arg 


Phe 


Ser 


Ala Pro 








1355 


Ala 


Ser 


Gly 


Asn Trp 






1370 


Cys 


Ser 


Arg 


Pro Gly 








1385 


Ala 


Ala 


Gly Gly Pro 








1400 


Glu 


Val 


Val 


Gin Asn 








1415 


Val 


Lys 


Thr 


Tyr Pro 








1430 


Lys 


Trp 


Val 


Asn Glu 








1445 


Arg 


Asp 


Pro 


Gly Leu 








1460 









970 


Lys 


Leu 


Ser 


Val Ala 








985 


He 


Leu 


Asp 


Glu Pro 








1000 


Gly 


He 


Trp 


Glu Leu 








1015 


He 


Leu 


Ser 


Thr His 








1030 


Arg 


Val 


Ala 


Val Val 








1045 


Pro 


Leu 


Phe 


Leu Arg 








1060 


Leu 


Val 


Lys 


Ala Arg 








1075 


Thr 


Asp 


Met 


Glu Gly 








1090 


Gly 


Ser 


Gin 


Gly Ser 








1105 


Val 


Gin 


His 


Trp Val 








1120 


His 


Glu 


Leu 


Val Leu 








1135 


Ser 


Phe 


Ala 


Thr Leu 








1150 


Leu 


Arg 


Leu 


Thr Gly 








1165 


He 


Phe 


Leu 


Lys Val 








1180 


Glu 


Asp 


Gly 


Ser Cys 








1195 


Leu 


Asp 


Val 


Thr Leu 








1210 


Leu 


Glu 


Asn 


Gly Glu 








1225 


Gly 


Ser 


Gly 


Pro Asp 








1240 


Thr 


Arg 


Gin 


Gin Leu 








1255 


Ala 


Arg 


Arg 


Ser Arg 








1270 


Ala 


Leu 


Phe 


Val Gly 








1285 


Pro 


Phe 


Gly 


His Tyr 








1300 


Gly 


Ala 


Gin 


Val Ser 








1315 


Gly 


Arg 


Ala 


Arg Leu 








1330 


Glu 


Glu 


Pro 


Pro Val 








1345 


Glu 


Val 


Pro 


Ala Glu 








1360 


Thr 


Pro 


Glu 


Ser Pro 








1375 


Ala 


Arg 


Arg 


Leu Leu 








1390 


Pro 


Pro 


Pro 


Gin Ala 








1405 


Gin 


Thr 


Gly 


Arg Asn 








1420 


Arg 


Leu 


Val 


Arg Gin 








1435 


Val 


Arg 


Tyr 


Gly Gly 








1450 


Pro 


Ser 


Gly 


Gin Glu 








1465 



975 



He 


Ala 


Phe 


Val Gly 
990 


Thr 


Ala 


Gly 


Val Asp 
1005 


Leu 


Leu 


Lys 


Tyr Arg 
1020 


His 


Leu 


Asp 


Glu Ala 
1035 


Ala 


Gly 


Gly 


Arg Leu 
1050 


Arg 


His 


Leu 


Gly Ser 
1065 


Leu 


Pro 


Leu 


Thr Thr 
1080 


Ser 


Val 


Asp 


Thr Arg 
1095 


Arg 


Val 


Gly 


Thr Pro 
1110 


Pro 


Gly 


Ala 


Arg Leu 
1125 


Val 


Leu 


Pro 


Tyr Thr 
1140 


Phe 


Arg 


Glu 


Leu Asp 
1155 


Tyr 


Gly 


He 


Ser Asp 
1170 


Val 


Glu 


Glu 


Cys Ala 
1185 


Gly 


Gin 


His 


Leu Cys 
1200 


Arg 


Leu 


Lys 


Met Pro 
1215 


Pro 


Ala 


Gly 


Ser Ala 
1230 


Ala 


Val 


Gly 


Arg Val 
1245 


Gin 


Ala 


Leu 


Leu Leu 
1260 


Arg 


Gly 


Leu 


Phe Ala 
1275 


Leu 


Ala 


Leu 


Val Phe 
1290 


Pro 


Ala 


Leu 


Arg Leu 
1305 


Phe 


Phe 


Ser 


Glu Asp 
1320 


Leu 


Glu 


Ala 


Leu Leu 
1335 


Gin 


His 


Ser 


Ser His 
1350 


Val 


Ala 


Lys 


Val Leu 
1365 


Ser 


Pro 


Ala 


Cys Gin 
1380 


Pro 


Asp 


Cys 


Pro Ala 
1395 


Val 


Thr 


Gly 


Ser Gly 
1410 


Leu 


Ser 


Asp 


Phe Leu 
1425 


Gly 


Leu 


Lys 


Thr Lys 
1440 


Phe 


Ser 


Leu 


Gly Gly 
1455 


Leu 


Gly 


Arg 


Ser Val 



1470 
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Glu 


Glu 


Leu 


Trp Ala 
1475 


Leu 


Leu 


Ser 


Pro Leu 
1480 


Pro 


Gly 


Gly 


Ala Leu 
1485 


Asp 


Arg 


Val 


Leu Lys 
1490 


Asn 


Leu 


Thr 


Ala Trp 
1495 


Ala 


His 


Ser 


Leu Asp 
1500 


Ala 


Gin 


Asp 


Ser Leu 
1505 


Lys 


He 


Trp 


Phe Asn 
1510 


Asn 


Lys 


Gly 


Trp His 
1515 


Ser 


Met 


Val 


Ala Phe 
1520 


Val 


Asn 


Arg 


Ala Ser 
1525 


Asn 


Ala 


He 


Leu Arg 
1530 


Ala 


His 


Leu 


Pro Pro 


Gly 


Pro 


Ala 


Arg His 


Ala 


His 


Ser 


He Thr 








1535 






1540 








1545 


Thr 


Leu 


Asn 


His Pro 
1550 


Leu 


Asn 


Leu 


Thr Lys 
1555 


Glu 


Gin 


Leu 


Ser Glu 
1560 


Ala 


Ala 


Leu 


Met Ala 
1565 


Ser 


Ser 


Val 


Asp Val 
1570 


Leu 


Val 


Ser 


He Cys 
1575 


Val 


Val 


Phe 


Ala Met 
1580 


Ser 


Phe 


Val 


Pro Ala 
1585 


Ser 


Phe 


Thr 


Leu Val 
1590 


Leu 


He 


Glu 


Glu Arg 
1595 


Val 


Thr 


Arg 


Ala Lys 
1600 


His 


Leu 


Gin 


Leu Met 
1605 


Gly 


Gly 


Leu 


Ser Pro 
1610 


Thr 


Leu 


Tyr 


Trp Leu 
1615 


Gly 


Asn 


Phe 


Leu Trp 
1620 


Asp 


Met 


Cys 


Asn Tyr 
1625 


Leu 


Val 


Pro 


Ala Cys 
1630 


He 


Val 


Val 


Leu He 
1635 


Phe 


Leu 


Ala 


Phe Gin 
1640 


Gin 


Arg 


Ala 


Tyr Val 
1645 


Ala 


Pro 


Ala 


Asn Leu 
1650 


Pro 


Ala 


Leu 


Leu Leu 
1655 


Leu 


Leu 


Leu 


Leu Tyr 
1660 


Gly 


Trp 


Ser 


He Thr 
1665 


Pro 


Leu 


Met 


Tyr Pro 
1670 


Ala 


Ser 


Phe 


Phe Phe 
1675 


Ser 


Val 


Pro 


Ser Thr 
1680 


Ala 


Tyr 


Val 


Val Leu 


Thr 


Cys 


He 


Asn Leu 


Phe 


He 


Gly 


He Asn 






1685 








1690 








1695 


Gly 


Ser 


Met 


Ala Thr 


Phe 


Val 


Leu 


Glu Leu 


Phe 


Ser 


Asp 


Gin Lys 






1700 








1705 








1710 


Leu 


Gin 


Glu 


Val Ser 


Arg 


He 


Leu 


Lys Gin 


Val 


Phe 


Leu 


He Phe 








1715 






1720 








1725 


Pro 


His 


Phe 


Cys Leu 
1730 


Gly 


Arg 


Gly 


Leu He 
1735 


Asp 


Met 


Val 


Arg Asn 
1740 


Gin 


Ala 


Met 


Ala Asp 
1745 


Ala 


Phe 


Glu 


Arg Leu 
1750 


Gly 


Asp 


Arg 


Gin Phe 
1755 


Gin 


Ser 


Pro 


Leu Arg 
1760 


Trp 


Glu 


Val 


Val Gly 
1765 


Lys 


Asn 


Leu 


Leu Ala 
1770 


Met 


Val 


He 


Gin Gly 
1775 


Pro 


Leu 


Phe 


Leu Leu 
1780 


Phe 


Thr 


Leu 


Leu Leu 
1785 


Gin 


His 


Arg 


Ser Gin 
1790 


Leu 


Leu 


Pro 


Gin Pro 
1795 


Arg 


Val 


Arg 


Ser Leu 
1800 


Pro 


Leu 


Leu 


Gly Glu 
1805 


Glu 


Asp 


Glu 


Asp Val 
1810 


Ala 


Arg 


Glu 


Arg Glu 
1815 


Arg 


Val 


Val 


Gin Gly 
1820 


Ala 


Thr 


Gin 


Gly Asp 
1825 


Val 


Leu 


Val 


Leu Arg 
1830 


Asn 


Leu 


Thr 


Lys Val 
1835 


Tyr 


Arg 


Gly 


Gin Arg 
1840 


Met 


Pro 


Ala 


Val Asp 
1845 


Arg 


Leu 


Cys 


Leu Gly 
1850 


He 


Pro 


Pro 


Gly Glu 
1855 


Cys 


Phe 


Gly 


Leu Leu 
1860 


Gly 


Val 


Asn 


Gly Ala 
1865 


Gly 


Lys 


Thr 


Ser Thr 
1870 


Phe 


Arg 


Met 


Val Thr 
1875 


Gly 


Asp 


Thr 


Leu Ala 
1880 


Ser 


Arg 


Gly 


Glu Ala 
1885 


Val 


Leu 


Ala 


Gly His 
1890 


Ser 


Val 


Ala 


Arg Glu 
1895 


Pro 


Ser 


Ala 


Ala His 
1900 


Leu 


Ser 


Met 


Gly Tyr 
1905 


Cys 


Pro 


Gin 


Ser Asp 
1910 


Ala 


He 


Phe 


Glu Leu 
1915 


Leu 


Thr 


Gly 


Arg Glu 
1920 


His 


Leu 


Glu 


Leu Leu 
1925 


Ala 


Arg 


Leu 


Arg Gly 
1930 


Val 


Pro 


Glu 


Ala Gin 
1935 


Val 


Ala 


Gin 


Thr Ala 


Gly 


Ser 


Gly 


Leu Ala 


Arg 


Leu 


Gly 


Leu Ser 








1940 




1945 








1950 


Trp 


Tyr 


Ala 


Asp Arg 
1955 


Pro 


Ala 


Gly 


Thr Tyr 
1960 


Ser 


Gly 


Gly 


Asn Lys 
1965 


Arg 


Lys 


Leu 


Ala Thr 


Ala 


Leu 


Ala 


Leu Val 


Gly 


Asp 


Pro 


Ala Val 
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1970 








Val 


Phe 


Leu 


Asp Glu 


Pro 


Thr 


Thr 








1985 








Arg 


Phe 


Leu 


Trp Asn 


Ser 


Leu 


Leu 








2000 








Ser 


Val 


Met 


Leu Thr 


Ser 


His 


Ser 








2015 








Cys 


Ser 


Arg 


Leu Ala 


He 


Met 


Val 








2030 








Gly 


Ser 


Pro 


Gin His 


Leu 


Lys 


Gly 








2045 








Leu 


Thr 


Leu 


Arg Val 


Pro 


Ala 


Ala 








2060 








Phe 


Val 


Ala 


Ala Glu 


Phe 


Pro 


Gly 








2075 








Gly 


Gly 


Arg 


Leu Arg 


Phe 


Gin 


Leu 








2090 








Leu 


Ala 


Arg 


Val Phe 


Gly Glu 


Leu 








2105 








Gly 


Val 


Glu 


Asp Phe 


Ser 


Val 


Ser 








2120 








Phe 


Leu 


Tyr 


Phe Ser 


Lys 


Asp 


Gin 








2135 








Glu 


Gin 


Lys 


Glu Ala 


Gly Val 


Gly 








2150 








Gin 


His 


Pro 


Lys Arg 


Val 


Ser 


Gin 








2165 








Ala 


Glu 


Thr 


Val Leu 














2180 
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<400> 28 



Met 


Leu 


Lys 


Arg 


Lys 


Gin 


Ser 


Ser 


1 








5 








Thr 


Asp 


Phe 


Gly 


Pro 
20 


Asp 


Glu 


Ser 


Leu 


Trp 


He 


Asn 


Lys 

35 


Pro 


Trp 


Val 


Ala 


He 


He 


Ser 


Val 
50 


He 


Ser 


Val 


Phe 


Glu 


His 


Tyr 


Pro 
65 


Pro 


Leu 


Gin 


Thr 


Leu 


Leu 


Met 


Phe 
80 


Leu 


Tyr 


Thr 


His 


He 


Arg 


Gly 


He 
95 


Val 


Lys 


Gly 


Arg 


Trp 


Cys 


Val 


Phe 
110 


Asp 


Gly 


Phe 


Ser 


Leu 


Val 


Leu 


Gin 
125 


Val 


Phe 


Glu 


Met 


Ser 


Pro 


Trp 


Gly 
140 


Met 


Leu 


Arg 


He 


Arg 


Ala 


Phe 


Arg 
155 


He 


Tyr 


Phe 


Arg 


He 


Thr 


Asn 


He 
170 


Leu 


Lys 


Arg 


Val 


Ser 


He 


Phe 


Leu 

185 


Leu 


Phe 


Phe 


Gly 


Val 


Gin 


Met 


Phe 


Gly 


Thr 


Phe 



1975 








1980 


Gly 


Met 


Asp 


Pro 


Ser 


Ala 


Arg 


1990 








1995 


Ala 


Val 


Val 


Arg 


Glu 


Gly 


Arg 


2005 








2010 


Met 


Glu 


Glu 


Cys 


Glu 


Ala 


Leu 


2020 








2025 


Asn 


Gly 


Arg 


Phe 


Arg 


Cys 


Leu 


2035 








2040 


Arg 


Phe 


Ala 


Ala 


Gly 


His 


Thr 


2050 








2055 


Arg 


Ser 


Gin 


Pro 


Ala 


Ala 


Ala 


2065 








2 07 0 


Ala 


Glu 


Leu 


Arg 


Glu 


Ala 


His 


2080 








2085 


Pro 


Pro 


Gly 


Gly 


Arg 


Cys 


Ala 


2095 








2100 


Ala 


Val 


His 


Gly 


Ala 


Glu 


His 


2110 








2115 


Gin 


Thr 


Met 


Leu 


Glu 


Glu 


Val 


2125 








2130 


Gly 


Lys 


Asp 


Glu 


Asp 


Thr 


GlU 


2140 








2145 


Val 


Asp 


Pro 


Ala 


Pro 


Gly 


Leu 


2155 








2160 


Phe 


Leu 


Asp 


Asp 


Pro 


Ser 


Thr 


2170 








2175 


Arg 


Val 


Glu 


Ala 


Gin 


Pro 


Val 




10 










15 


Leu 


Ser 


Asp 


Asn 


Ala 


Asp 


He 




25 










30 


His 


Ser 


Leu 


Leu 


Arg 


He 


Cys 




40 










45 


Cys 


Met 


Asn 


Thr 


Pro 


Met 


Thr 




55 










60 


Tyr 


Val 


Thr 


Phe 


Thr 


Leu 


Asp 




70 










75 


Ala 


Glu 


Met 


He 


Ala 


Lys 


Met 




85 








90 


Asp 


Ser 


Ser 


Tyr 


Val 


Lys 


Asp 




100 










105 


Met 


Val 


Phe 


Cys 


Leu 


Trp 


Val 




115 










120 


He 


Ala 


Asp 


He 


Val 


Asp 


Gin 




130 










135 


He 


Pro 


Arg 


Pro 


Leu 


He 


Met 




145 










150 


Arg 


Phe 


Glu 


Leu 


Pro 


Arg 


Thr 




160 










165 


ser 


Gly 


Glu 


Gin 


He 


Trp 


Ser 




175 










180 


Leu 


Leu 


Leu 


Tyr 


Gly 


He 


Leu 




190 










195 


Thr 


Tyr 


His 


Cys 


Val 


Val 


Asn 



49/87 



BNSDOCID: <WO 0204S20A2_I_> 



WO 02/04520 



PCT/US01/21448 



Asp 


Thr 


Lys 


Pro 


Gly 


Asn 


Val 










215 






Asp 


Thr 


His 


Cys 


Ser 


Pro 


Glu 










230 






Pro 


Gly 


Phe 


Lys 


Cys 


Met 


Asp 










245 






Gin 


Glu 


Leu 


Gly 


Tyr 


Ser 


Gly 










260 






Phe 


Thr 


Val 


Tyr 


Glu 


Ala 


Ala 








275 






Met 


Tyr 


Arg 


Ala 


He 


Asp 


Ser 










290 






Tyr 


Phe 


He 


Thr 


Leu 


He 


Phe 








305 






Val 


Phe 


He 


Ala 


Val 


He 


He 










320 






Gin 


Phe 


Gin 


Gin 


Met 


Trp 


Gly 










335 






Ala 


Thr 


Thr 


Gin 


Met 


Phe 


His 










350 






Leu 


Val 


Ala 


Val 


Asp 


Val 


Asn 










365 






Cys 


Leu 


Gin 


Lys 


Met 


Met 


Arg 










380 






Leu 


Ser 


Met 


Val 


Thr 


Val 


Asp 










395 






Tyr 


Lys 


Gly 


Glu 


Asn 


Phe 


Arg 










410 






Ala 


Glu 


Val 


Ala 


Phe 


Thr 


Val 










425 






Lys 


lie 


Trp 


Cys 


Leu 


Gly 


Phe 










440 






His 


Lys 


Phe 


Glu 


Leu 


Leu 


Leu 








455 






Tyr 


Pro 


Asp 


Leu 


Tyr 


His 


Ser 










470 






Arg 


Val 


Val 


Arg 


Leu 


He 


Lys 










485 






Val 


Tyr 


Lys 


He 


Phe 


Gly 


Pro 










500 






Val 


Phe 


Thr 


Ala 


Ser 


Leu 


Leu 










515 






Gin 


Met 


Phe 


Cys 


Phe 


Val 


Glu 










530 






Pro 


Arg 


Ala 


Phe 


Met 


Ser 


Met 








545 






Trp 


Val 


Asp 


Val 


Met 


Asp 


Gin 










560 






Trp 


Ala 


Pro 


Val 


Val 


Ala 


He 








575 






Ala 


Thr 


Leu 


He 


Leu 


Leu 


Ser 










590 






Asn 


Leu 


Glu 


Leu 


Asp 


Glu 


Asp 










605 






Gin 


Ser 


Glu 


Ala 


Asn 


Ala Asp 










620 






Leu 


Arg 


He 


Phe 


Glu 


Lys 


Phe 










635 






lie 


Ser 


Lys 


Leu 


Pro 


Ser 


Asp 










650 






Ser 


Phe 


Met 


Lys 


Gin 


Phe 


He 








665 






Cys 


Leu 


Leu 


Arg 


Ser 


Leu 


Pro 










680 






His 


Ser 


Lys 


Arg 


Ser 


Ala 


He 



695 







205 










210 


Thr 


Trp 


Asn 


Ser 


Leu 


Ala 


He 


Pro 




220 










225 


Leu 


Glu 


Glu 


Gly 


Tyr 


Gin 


Cys 


Pro 






235 










240 


Leu 


Glu 


Asp 


Leu 


Gly 


Leu 


Ser 


Arg 






250 










255 


Phe 


Asn 


Glu 


He 


Gly 


Thr 


Ser 


He 






265 










270 


Ser 


Gin 


Glu 


Gly 


Trp 


Val 


Phe 


Leu 






280 










285 


Phe 


Pro 


Arg 


Trp 


Arg 


Ser 


Tyr 


Phe 






295 










300 


Phe 


Leu 


Ala 


Trp 


Leu 


Val 


Lys 


Asn 






310 










315 


Glu 


Thr 


Phe 


Ala 


Glu 


He Arg 


Val 






325 










330 


Ser 


Arg 


Ser 


Ser 


Thr 


Thr 


Ser 


Thr 






340 










345 


Glu 


Asp 


Ala 


Ala Gly Gly Trp 


Gin 






355 










360 


Lys 


Pro 


Gin 


Gly 


Arg 


Ala 


Pro 


Ala 






370 










375 


Ser 


Ser 


Val 


Phe 


His 


Met 


Phe 


He 






385 










390 


Val 


He 


Val 


Ala 


Ala 


Ser 


Asn 


Tyr 






400 










405 


Arg 


Gin 


Tyr 


Asp 


Glu 


Phe 


Tyr 


Leu 






415 










420 


Leu 


Phe 


Asp 


Leu 


Glu 


Ala 


Leu 


Leu 






430 










435 


Thr 


Gly 


Tyr 


He 


Ser 


Ser 


Ser 


Leu 




445 










450 


Val 


He 


Gly 


Thr 


Thr 


Leu 


His 


Val 






460 










465 


Gin 


Phe 


Thr 


Tyr 


Phe 


Gin 


Val 


Leu 






475 










480 


He 


Ser 


Pro 


Ala 


Leu 


Glu 


Asp 


Phe 






490 










495 


Gly 


Lys 


Lys 


Leu 


Gly 


Ser 


Leu 


Val 






505 










510 


He 


Val 


Met 


Ser 


Ala 


He 


Ser 


Leu 






520 










525 


Glu 


Leu 


Asp 


Arg 


Phe 


Thr 


Thr 


Phe 






535 










540 


Phe 


Gin 


He 


Leu 


Thr 


Gin 


Glu 


Gly 






550 










555 


Thr 


Leu 


Asn 


Ala 


Val 


Gly 


His 


Met 






565 








570 


Tyr 


Phe 


He 


Leu Tyr 


His 


Leu 


Phe 




580 










585 


Leu 


Phe 


Val 


Ala 


Val 


He 


Leu 


Asp 






595 










600 


Leu 


Lys 


Lys 


Leu 


Lys 


Gin 


Leu 


Lys 






610 










615 


Thr 


Lys 


Glu 


Lys 


Leu 


Pro 


Leu 


Arg 






625 










63 0 


Pro 


Asn 


Arg 


Pro 


Gin 


Met 


Val 


Lys 






640 










645 


Phe 


Thr 


Val 


Pro 


Lys 


He 


Arg 


Glu 






655 










660 


Asp 


Arg 


Gin 


Gin Gin Asp 


Thr 


Cys 






670 










675 


Thr 


Thr 


Ser 


Ser 


Ser 


Ser 


Cys 


Asp 






685 










690 


Glu 


Asp 


Asn 


Lys 


Tyr 


He 


Asp 


Gin 






700 










705 
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Lys 


Leu 


Arg 


Lys 


Ser 


Val 


Phe 


Ser 


He Arg 


Ala 


Arg 


Asn 


Leu Leu 










710 








715 








720 


Glu 


Lys 


Glu 


Thr 


Ala 


Val 


Thr 


Lys 


He Leu 


Arg 


Ala 


Cys 


Thr Arg 










725 








730 








735 


Gin 


Arg 


Met 


Leu 


Ser 


Gly 


Ser 


Phe 


Glu Gly 


Gin 


Pro 


Ala 


Lys Glu 








740 








745 








750 


Arg 


Ser 


He 


Leu 


Ser 


Val 


Gin 


His 


His He 


Arg 


Gin 


Glu 


Arg Arg 










755 








760 








765 


Ser 


Leu 


Arg 


His 


Gly 


Ser 


Asn 


Ser 


Gin Arg 


He 


Ser 


Arg 


Gly Lys 










770 








775 








780 


Ser 


Leu 


Glu 


Thr 


Leu 


Thr 


Gin 


Asp 


His Cys 


Asn 


Thr 


Val 


He Tyr 










785 








790 








795 


Arg 


Asn 


Ala 


Gin 


Arg 


Glu 


Val 


Ser 


Glu He 


Lys 


Met 


He 


Gin Glu 








800 








805 








810 


Lys 


Lys 


Glu 


Leu 


Ala 


Glu 


Met 


Leu 


Gin Gly 


Lys 


Cys 


Lys 


Lys Glu 










815 








820 








825 


Leu 


Arg 


Glu 


Ser 


His 


Pro 


Tyr 


Phe 


Asp Lys 


Pro 


Leu 


Phe 


He Val 










830 








835 








840 


Gly 


Arg 


Glu 


His 


Arg 


Phe 


Arg 


Asn 


Phe Cys 


Arg 


Val 


Val 


Val Arg 










845 








850 








855 


Ala 


Arg 


Phe 


Asn 


Ala 


Ser 


Lys 


Thr 


Asp Pro 


Val 


Thr 


Gly 


Ala Val 










860 








865 








870 


Lys 


Asn 


Thr 


Lys 


Tyr 


His 


Leu 


Leu 


Tyr Asp 


Leu 


Leu 


Gly 


Leu Val 








875 








880 








885 


Thr 


Tyr 


Leu 


Asp 


Trp 


Val 


Met 


He 


He Val 


Thr 


Ser 


Asp 


Ser Cys 










890 








895 








900 


He 


Ser 


Met 


Met 


Phe 


Glu 


Ser 


Pro 


Phe Arg 


Arg 


Val 


Met 


His Ala 










905 








910 








915 


Pro 


Thr 


Leu 


Gin 


He 


Ala 


Glu 


Tyr 


Val Phe 


Val 


He 


Phe 


Met Ser 










920 






925 








930 


He 


Glu 


Leu 


Asn 


Leu 


Lys 


He 


Met 


Ala Asp 


Gly 


Leu 


Phe 


Phe Thr 










935 






940 








945 


Pro 


Thr 


Ala 


Val 


He 


Arg 


Asp 


Phe 


Gly Gly 


Val 


Met 


Asp 


He Phe 










950 








955 








960 


He 


Tyr 


Leu 


Val 


Ser 


Leu 


He 


Phe 


Leu Cys 


Trp 


Met 


Pro 


Gin Asn 










965 








970 








975 


Val 


Pro 


Ala 


Glu 


Ser 


Gly 


Ala 


Gin 


Leu Leu 


Met 


Val 


Leu 


Arg Cys 










980 






985 








990 


Leu 


Arg 


Pro 


Leu 


Arg 


He 


Phe 


Lys 


Leu Val 


Pro 


Gin 


Met 


Arg Lys 










995 








1000 








1005 


Val 


Val 


Arg 


Glu 


Leu 


Phe 


Ser 


Gly 


Phe Lys 


Glu 


He 


Phe 


Leu Val 










1010 








1015 








1020 


Ser 


He 


Leu 


Leu 


Leu 


Thr 


Leu 


Met 


Leu Val 


Phe 


Ala 


Ser 


Phe Gly 










1025 








1030 








1035 


Val 


Gin 


Leu 


Phe 


Ala 


Gly 


Lys 


Leu 


Ala Lys 


Cys 


Asn 


Asp 


Pro Asn 










1040 








1045 








1050 


He 


He 


Arg 


Arg 


Glu 


Asp 


Cys 


Asn 


Gly He 


Phe 


Arg 


He 


Asn Val 










1055 








1060 








1065 


Ser 


Val 


Ser 


Lys 


Asn 


Leu 


Asn 


Leu 


Lys Leu 


Arg 


Pro 


Gly 


Glu Lys 










1070 








1075 








1080 


Lys 


Pro 


Gly 


Phe 


Trp 


Val 


Pro 


Arg 


Val Trp 


Ala 


Asn 


Pro 


Arg Asn 










1085 








1090 








1095 


Phe 


Asn 


Phe 


Asp 


Asn 


Val 


Gly 


Asn 


Ala Met 


Leu 


Ala 


Leu 


Phe Glu 










1100 






1105 








1110 


Val 


Leu 


Ser 


Leu 


Lys 


Gly 


Trp 


Val 


Glu Val 


Arg 


Asp 


Val 


He He 










1115 








1120 








1125 


His 


Arg 


Val 


Gly 


Pro 


He 


His 


Gly 


He Tyr 


He 


His 


Val 


Phe Val 










1130 








1135 








1140 


Phe 


Leu 


Gly 


Cys 


Met 


He 


Gly 


Leu 


Thr Leu 


Phe 


Val 


Gly 


Val Val 










1145 








1150 








1155 


He 


Ala 


Asn 


Phe 


Asn 


Glu 


Asn 


Lys 


Gly Thr 


Ala 


Leu 


Leu 


Thr Val 










1160 








1165 








1170 


Asp 


Gin 


Arg 


Arg 


Trp 


Glu 


Asp 


Leu 


Lys Ser 


Arg 


Leu 


Lys 


He Ala 










1175 








1180 








1185 


Gin 


Pro 


Leu 


His 


Leu 


Pro 


Pro 


Arg 


Pro Asp 


Asn 


Asp 


Gly 


Phe Arg 










1190 








1195 








1200 


Ala 


Lys 


Met 


Tyr 


Asp 


He 


Thr 


Gin 


His Pro 


Phe 


Phe 


Lys 


Arg Thr 
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1205 








1210 








1215 


lie 


Ala 


Leu 


Leu Val 


Leu 


Ala 


Gin 


Ser Val 


Leu 


Leu 


Ser 


Val Lys 








1220 








1225 








1230 


Trp 


Asp 


Val 


Glu Asp 


Pro 


Val 


Thr 


Val Pro 


Leu 


Ala 


Thr 


Met Ser 






1235 








1240 








1245 


Val 


Val 


Phe 


Thr Phe 


He 


Phe 


Val 


Leu Glu 


Val 


Thr 


Met 


Lys He 








1250 








1255 








1260 


lie 


Ala 


Met 


Ser Pro 


Ala 


Gly 


Phe 


Trp Gin 


Ser 


Arg 


Arg 


Asn Arg 








1265 








1270 








1275 


Tyr 


Asp 


Leu 


Leu Val 


Thr 


Ser 


Leu 


Gly Val 


Val 


Trp 


Val 


Val Leu 






1280 








1285 








1290 


His 


Phe 


Ala 


Leu Leu 


Asn 


Ala 


Tyr 


Thr Tyr 


Met 


Met 


Gly Ala Cys 








1295 








1300 








1305 


Val 


lie 


Val 


Phe Arg 


Phe 


Phe 


Ser 


He Cys 


Gly 


Lys 


His 


Val Thr 








1310 








1315 








1320 


Leu 


Lys 


Met 


Leu Leu 


Leu 


Thr 


Val 


Val Val 


Ser 


Met 


Tyr 


Lys Ser 






1325 








1330 








1335 


Phe 


Phe 


He 


He Val 


Gly 


Met 


Phe 


Leu Leu 


Leu 


Leu 


Cys 


Tyr Ala 








1340 








1345 








1350 


Phe 


Ala 


Gly Val Val 


Leu 


Phe 


Gly 


Thr Val 


Lys 


Tyr 


Gly Glu Asn 








1355 








1360 








1365 


lie 


Asn 


Arg 


His Ala 


Asn 


Phe 


Ser 


Ser Ala 


Gly 


Lys 


Ala 


He Thr 








.1370 








1375 








1380 


Val 


Leu 


Phe 


Arg He 


Val 


Thr 


Gly 


Glu Asp 


Trp 


Asn 


Lys 


He Met 








1385 








1390 








1395 


His 


Asp 


Cys 


Met Val 


Gin 


Pro 


Pro 


Phe Cys 


Thr 


Pro 


Asp 


Glu Phe 




1400 








1405 








1410 


Thr 


Tyr 


Trp Ala Thr 


Asp 


Cys 


Gly 


Asn Tyr 


Ala 


Gly 


Ala 


Leu Met 








1415 








1420 








1425 


Tyr 


Phe 


Cys 


Ser Phe 


Tyr 


Val 


He 


He Ala 


Tyr 


He 


Met 


Leu Asn 




1430 








1435 








1440 


Leu 


Leu 


Val 


Ala He 


He 


Val 


Glu 


Asn Phe 


Ser 


Leu 


He 


Tyr Ser 








1445 








1450 








1455 


Thr 


GlU 


Glu Asp Gin 


Leu 


Leu 


Ser 


Tyr Asn 


Asp 


Leu 


Arg 


His Phe 








1460 








1465 








1470 


Gin 


He 


He 


Trp Asn 


Met 


Val 


Asp 


Asp Lys 


Arg 


Glu 


Val 


Phe Pro 








1475 








1480 








1485 


Thr 


Phe 


Arg 


Val Lys 


Phe 


Leu 


Leu 


Arg Leu 


Leu 


Arg 


Gly Arg Leu 








1490 








1495 








1500 


Glu 


Val 


Asp 


Leu Asp 


Lys 


Asp 


Lys 


Leu Leu 


Phe 


Lys 


His 


Met Cys 








1505 








1510 








1515 


Tyr 


Glu 


Met 


Glu Arg 


Leu 


His 


Asn 


Gly Gly 


Asp 


Val 


Thr 


Phe His 






1520 








1525 








1530 


Asp 


Val 


Leu 


Ser Met 


Leu 


Ser 


Tyr 


Arg Ser 


Val 


Asp 


He 


Arg Lys 








1535 








1540 








1545 


Ser 


Leu 


Gin 


Leu Glu 


Glu 


Leu 


Leu 


Ala Arg 


Glu 


Gin 


Leu 


Glu Tyr 








1550 








1555 








1560 


Thr 


He 


Glu 


Glu Glu 


Val 


Ala 


Lys 


Gin Thr 


He 


Arg 


Met 


Trp Leu 








1565 






1570 








1575 


Lys 


Lys 


Cys 


Leu Lys 


Arg 


He 


Arg 


Ala Lys 


Gin 


Gin 


Gin 


Ser Cys 








1580 








1585 








1590 


Ser 


He 


He 


His Ser 


Leu 


Arg 


Glu 


Ser Gin 


Gin 


Gin 


Glu 


Leu Ser 








1595 








1600 








1605 


Arg 


Phe 


Leu 


Asn Pro 


Pro 


Ser 


He 


Glu Thr 


Thr 


Gin 


Pro 


Ser Glu 








1610 








1615 








1620 


Asp 


Thr 


Asn 


Ala Asn 


Ser 


Gin 


Asp 


Asn Ser 


Met 


Gin 


Pro 


Glu Thr 






1625 






1630 








1635 


Ser 


Ser 


Gin 


Gin Gin 


Leu 


Leu 


Ser 


Pro Thr 


Leu 


Ser 


Asp Arg Gly 








1640 








1645 








1650 


Gly 


Ser 


Arg 


Gin Asp 


Ala 


Ala 


Asp 


Ala Gly 


Lys 


Pro 


Gin 


Arg Lys 








1655 








1660 








1665 


Phe 


Gly 


Gin 


Trp Arg 


Leu 


Pro 


Ser 


Ala Pro 


Lys 


Pro 


He 


Ser His 






1670 








1675 








1680 


Ser 


Val 


Ser 


Ser Val 


Asn 


Leu 


Arg 


Phe Gly Gly Arg 


Thr 


Thr Met 








1685 








1690 








1695 


Lys 


Ser 


Val 


Val Cys 


Lys 


Met 


Asn 


Pro Met 


Thr 


Asp 


Ala 


Ala Ser 






1700 








1705 








1710 
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Cys Gly Ser Glu Val Lys Lys Trp 
1715 

Glu Ser Asp Glu Ser Gly Asp Asp 
1730 

<210> 29 

<211> 547 

<212> PRT 
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<220> 
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<400> 29 



Met 


Ala 


Phe 


Gin 


Asp 


Leu 


Leu 


Asp 


1 








5 








Phe 


Gin 


He 


Leu 


Gin 
20 


Met 


Val 


Phe 


Val 


Tyr 


His 


Gin 


Thr 
35 


Gin 


Leu 


Glu 


Asp 


His 


Arg 


Cys 


Trp 
50 


Val 


His 


He 


Asp 


Asn 


Asp 


Pro 


Gly 

65 


Thr 


Leu 


Ser 


Ser 


He 


Pro 


Phe 


Asp 
80 


Ser 


Asn 


Leu 


Phe 


Val 


His 


Pro 


Gin 
95 


Trp 


Lys 


Leu 


Pro 


Asn 


Thr 


Ser 


GlU 
110 


Pro 


Asp 


Thr 


Val 


Tyr 


Asp 


Gin 


Ser 
125 


Ser 


Phe 


Pro 


Asp 


Leu 


Val 


Cys 


Glu 
140 


Ser 


Gin 


Pro 


Leu 


Phe 


Met 


Ala 


Gly 
155 


Met 


Met 


Val 


Leu 


Ser 


Asp 


Arg 


Phe 


Gly Arg 


Lys 










17 0 








Leu 


Gin 


Leu 


Ala 


He 
185 


Val 


Gly 


Thr 


He 


Leu 


Val 


Tyr 


Cys 
200 


Ser 


Leu 


Arg 


Phe 


Ser 


He 


He 


Val 
215 


Asn 


Thr 


Val 


Thr 


His 


Gin 


Phe 


Cys 
230 


Ala 


Met 


Ala 


Ala 


Ser 


He Gly 


His 


He 


Thr 


Leu 










245 








Arg 


Asp 


Gin 


Cys 


He 
260 


Leu 


Gin 


Leu 


Val 


Phe 


Phe 


Leu 


Phe 
275 


Ser 


Arg 


Trp 


Leu 


He 


He 


Asn 


Asn 
290 


Lys 


Pro 


Glu 


Lys 


Ala 


Ala 


His 


Arg 


Asn Gly Met 










305 








Thr 


Met 


Glu 


Val 


Leu 

320 


Lys 


Ser 


Thr 


Ala 


Gin 


Lys 


Lys 


His 
335 


Ser 


Leu 


Cys 


He 


Cys 


Lys 


Arg 


He 
350 


Cys 


Phe 


Leu 


Thr 


He 


Pro 


Phe 


Trp 


Gly Leu 


Thr 










365 








Asn 


Asn 


Val 


Phe 


Leu 
380 


Leu 


Gin 


Thr 



Trp 


Thr 


Arg 


Gin 


Leu 


Thr 


Val 


1720 






1725 


Leu 


Leu 


Asp 


He 








1735 












pin 


val 


Gly Gly 


Leu 


Gly 


Arg 














15 


Leu 


11c 


Met 


Phe 


Asn 


Val 


He 
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30 


Asn 


irne 


Ala 


Ala 


Phe 


He 


Leu 




AO 










45 


Leu 


Asp 


Asn 


Asp 


Thr 


He 


Pro 




3 3 










60 


turn 


Asp 


Ala 


Leu 


Leu 


Arg 


He 




n n 










75 


Arg 


Pro 


Glu 


Lys 


Cys 


Arg 
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o D 










90 
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rilS 


Leu 


Asn 


Gly Thr 


Phe 
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Cys 


Val 
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Gly 


Trp 
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120 


Ser 


Thr 


He 


Val 


Thr 


Lys 


Trp 




IjU 










135 


Leu 


Asn 


Ser 


Val 


Ala 


Lys 


Phe 




1 
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Asn 


Leu 


Tyr 


Gly 


His 
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165 


rile 


Val 


Leu 


Arg 


Trp 


Ser 


Tyr 




175 
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180 


Cys 




Ala 


Phe 


Ala 


Pro 


Thr 




190 










195 


Phe 


Leu 


Ala 


Gly 


Ala 


Ala 


Thr 




205 
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Gin 




955 










960 


Asp 


He 


Phe 


Ser 


Val 


Ser 


Arg 




970 










975 


Asp 


Glu 


He 


His 


Phe 








985 












Gin 


Val 


Gly 


Gly 


Leu 


Gly 


Arg 




10 










15 


He 


Leu 


Pro 


Ser 


Leu 


Met 


Leu 




25 










30 


Asn 


Phe 


Ala 


Ala 


Ala 


He 


Pro 




40 










45 


Leu 


Asp 


Asn 


Asn 


Thr 


Gly 


Ser 
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50 55 60 



Gly 


Asn 


Glu 


Thr 


Gly 


He 


Leu 


Ser 


Glu 


Asp 


Ala 


Leu 


Leu 


Arg 


He 










65 










70 










75 


Ser 


He 


Pro 


Leu 


Asp 


Ser 


Asn 


Leu 


Arg 


Pro 


Glu 


Lys 


Cys 


Arg 


Arg 










80 










85 










90 


Phe 


Val 


His 


Pro 


Gin 


Trp 


Gin 


Leu 


Leu 


His 


Leu 


Asn 


Gly 


Thr 


He 










95 










100 










105 


His 


Ser 


Thr 


Ser 


Glu 


Ala 


Asp 


Thr 


Glu 


Pro 


Cys 


Val 


Asp Gly Trp 










110 










115 










120 


Val 


Tyr 


Asp 


Gin 


Ser 


Tyr 


Phe 


Pro 


Ser 


Thr 


He 


Val 


Thr 


Lys 


Trp 










125 










130 










135 


Asp 


Leu 


Val 


Cys 


Asp 


Tyr 


Gin 


Ser 


Leu 


Lys 


Ser 


Val 


Val 


Gin 


Phe 










140 










145 










150 


Leu 


Leu 


Leu 


Thr 


Gly 


Met 


Leu 


Val 


Gly 


Gly 


He 


He 


Gly Gly 


His 










155 










160 










165 


Val 


Ser 


Asp 


Arg 


Phe 


Gly 


Arg 


Arg 


Phe 


He 


Leu 


Arg 


Trp 


Cys 


Leu 










170 










175 










180 


Leu 


Gin 


Leu 


Ala 


He 


Thr 


Asp 


Thr 


Cys 


Ala 


Ala 


Phe 


Ala 


Pro 


Thr 










185 










190 










195 


Phe 


Pro 


Val 


Tyr 


Cys 


Val 


Leu 


Arg 


Phe 


Leu 


Ala 


Gly 


Phe 


Ser 


Ser 










200 










205 










210 


Met 


He 


He 


He 


Ser 


Asn 


Asn 


Ser 


Leu 


Pro 


He 


Thr 


Glu 


Trp 


He 










215 










220 










225 


Arg 


Pro 


Asn 


Ser 


Lys 


Ala 


Leu 


Val 


Val 


He 


Leu 


Ser 


Ser 


Gly 


Ala 










230 










235 










240 


Leu 


Ser 


He 


Gly 


Gin 


He 


He 


Leu 


Gly 


Gly 


Leu 


Ala 


Tyr 


Val 


Phe 










245 










250 










255 


Arg 


Asp 


Trp 


Gin 


Thr 


Leu 


His 


Val 


Val 


Ala 


Ser 


Val 


Pro 


Phe 


Phe 






260 










265 










270 


Val 


Phe 


Phe 


Leu 


Leu 


Ser 


Arg 


Trp 


Leu 


Val 


Glu 


Ser 


Ala 


Arg 


Trp 










275 










280 










285 


Leu 


He 


He 


Thr 


Asn 


Lys 


Leu 


Asp 


Glu 


Gly 


Leu 


Lys 


Ala 


Leu 


Arg 










290 










295 










300 


Lys 


Val 


Ala 


Arg 


Thr 


Asn 


Gly 


He 


Lys 


Asn 


Ala 


Glu 


Glu 


Thr 


Leu 










305 










310 










315 


Asn 


He 


Glu 


Val 


Val 


Arg 


Ser 


Thr 


Met 


Gin 


Glu 


Glu 


Leu 


Asp 


Ala 










320 








325 










330 


Ala 


Gin 


Thr 


Lys 


Thr 


Thr 


Val 


Cys 


Asp 


Leu 


Phe 


Arg 


Asn 


Pro 


Ser 










335 










340 










345 


Met 


Arg 


Lys 


Arg 


He 


Cys 


He 


Leu 


Val 


Phe 


Leu 


Arg 


Phe 


Ala 


Asn 










350 










355 










360 


Thr 


He 


Pro 


Phe 


Tyr 


Gly 


Thr 


Met 


Val 


Asn 


Leu 


Gin 


His 


Val 


Gly 










365 










370 










375 


Ser 


Asn 


He 


Phe 


Leu 


Leu 


Gin 


Val 


Leu 


Tyr 


Gly Ala 


Val 


Ala 


Leu 










380 










385 










390 


He 


Val 


Arg 


Cys 


Leu 


Ala 


Leu 


Leu 


Thr 


Leu 


Asn 


His 


Met 


Gly 


Arg 










395 










400 










405 


Arg 


He 


Ser 


Gin 


He 


Leu 


Phe 


Met 


Phe 


Leu 


Val 


Gly 


Leu 


Ser 


He 










410 










415 










420 


Leu 


Ala 


Asn 


Thr 


Phe 


Val 


Pro 


Lys 


Glu 


Met 


Gin 


Thr 


Leu Arg 


Val 










425 










43 0 










435 


Ala 


Leu 


Ala 


Cys 


Leu 


Gly 


He 


Gly 


Cys 


Ser 


Ala 


Ala 


Thr 


Phe 


Ser 










440 










445 










450 


Ser 


Val 


Ala 


Val 


His 


Phe 


He 


Glu 


Leu 


He 


Pro 


Thr 


Val 


Leu 


Arg 










455 










460 










465 


Ala 


Arg 


Ala 


Ser 


Gly 


lie 


Asp 


Leu 


Thr 


Ala 


Ser 


Arg 


He Gly 


Ala 










470 










475 










480 


Ala 


Leu 


Ala 


Pro 


Leu 


Leu 


Met 


Thr 


Leu 


Thr 


Val 


Phe 


Phe 


Thr 


Thr 










485 










490 










495 


Leu 


Pro 


Trp 


He 


He 


Tyr 


Gly 


He 


Phe 


Pro 


He 


He 


Gly Gly 


Leu 










500 










505 










510 


He 


Val 


Phe 


Leu 


Leu 


Pro 


Glu 


Thr 


Lys 


Asn 


Leu 


Pro 


Leu 


Pro 


Asp 










515 








520 










525 


Thr 


He 


Lys 


Asp 


Val 


Glu 


Asn 


Gin 

















530 
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<211> 1775 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 3474673CB1 

<400> 33 

atttcaggaa atgtgagggg gctctgggcc ccttccctca gcgcctgcgg tcacccagca 60 
gcttcctcct tctccctggc ctaggcctag cagg-tgggca ccccgcacac atttgaggcg 120 
gggccagatg cccacagttc agagcctctt tttgtcccgg ggattggatc ccagggctgg 180 
gtggggccag gctgtcccat tccccaacac tcctcctccc cggcgaaacc gggcaccagc 240 
aggcgtttgc gagaggagat acgagctgga cgcctggccc ttccctccca ccgggtccta 3 00 
gtccaccgct cccggcgccg gctccccgcc tctcccgcta tgtaccgacc gcgagcccgg 3 60 
gcggctcccg agggcagggt ccggggctgc gcggtgccca gcaccgtgct cctgctgctc 420 
gcctacctgg cttacctggc gctgggcacc ggcgtgttct ggacgctgga gggccgcgcg 480 
gcgcaggact ccagccgcag cttccagcgc gacaagtggg agctgttgca gaacttcacg 540 
tgtctggacc gcccggcgct ggactcgctg atccgggatg tcgtccaagc atacaaaaac 600 
ggagccagcc tcctcagcaa caccaccagc atggggcgct gggagctcgt gggctccttc 660 
ttcttttctg tgtccaccat caccaccatt ggctatggca acctgagccc caacacgatg 720 
gctgcccgcc tcttctgcat cttctttgcc cttgtgggga tcccactcaa cctcgtggtg 7 80 
ctcaaccgac tggggcatct catgcagcag ggagtaaacc actgggccag caggctgggg 840 
ggcacctggc aggatcctga caaggcgcgg tggctggcgg gctctggcgc cctcctctcg 9 00 
ggcctcctgc tcttcctgct gctgccaccg ctgctcttct cccacatgga gggctggagc 960 
tacacagagg gcttctactt cgccttcatc accctcagca ccgtgggctt cggcgactac 1020 
gtgattggaa tgaacccctc ccagaggtac ccactgtggt acaagaacat ggtgtccctg 1080 
tggatcctct ttgggatggc atggctggcc ttgatcatca aactcatcct ctcccagctg 1140 
gagacgccag ggagggtatg ttcctgctgc caccacagct ctaaggaaga cttcaagtcc 12 00 
caaagctgga gacagggacc tgaccgggag ccagagtccc actccccaca gcaaggatgc 1260 
tatccagagg gacccatggg aatcatacag catctggaac cttctgctca cgctgcaggc 132 0 
tgtggcaagg acagctagtt atactccatt ctttggtcgt cgtcctcggt agcaagaccc 1380 
ctgattttaa gctttgcaca tgtccaccca aactaaagac tacattttcc atccacccta 1440 
gaggctgggt gcagctatat gattaattct gcccaatagg gtatacagag acatgtcctg 1500 
ggtgacatgg gatgtgactt tcgggtgtcg gggcagcatg cccttctccc ccacttcctt 1560 
actttagcgg gctgcaatgc cgccgatatg atggctggga gctctggcag ccatacggca 1620 
ccatgaagta gcggcaatgt ttgagcggca caataagata ggaagagtct ggatctctga 1680 
tgatcacaga gccatcctaa caaacggaat atcacccgac ctcctttatg tgagagagaa 1740 
ataaacatct tatgtaaaat accaaaaaaa aaaaa 1775 

<210> 34 

<211> 1545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4588877CB1 

<400> 34 

aatgagggcc ctgggggttg ggcccaggag tggggctgtg gtggtgagtg gacagggctg 6 0 
ggctggaaat gtcccctgag tgccccctct cacctcaggc tatggcaacc tgagccccaa 120 
cacgatggct gcccgcctct tctgcatctt ctttgccctt gtggggatcc cactcaacct 180 
cgtggtgctc aaccgactgg ggcatctcat gcagcaggga gtaaaccact gggccagcag 240 
gctggggggc' acctggcagg tgagggggct gctggacggg gtggggatgg gtcacttcta 3 00 
gaatgagggg ctgtggtggg aattggggtt actaatgaca agaggtggga gcaagtgtta 3 60 
ctggtgaggt tgtgttggga ttgggggtca ctgctcagaa tagggtcctt agtgaaaaag 420 
ggcattaatg gtggagatgg ggtgggactg ggcagacagg aaggacatga ggcacaggct 480 
ccaggcaggg aacctggaga acacagacca ggtgaagagc ccccttctta ctggggacag 540 
ctctggcctg cctccagctc cctcggctcc cacgcatggg gtgaaggcct caggaggcct 600 
ggggacaata ttgcacccac aggatcctga caaggcgcgg tggctggcgg gctctggcgc 660 
cctcctctcg ggcctcctgc tcttcctgct gctgccaccg ctgctcttct cccacatgga 720 
gggctggagc tacacagagg gcttctactt cgccttcatc accctcagca ccgtgggctt 780 
cggcgactac gtgattggaa tgaacccctc ccagaggtac ccactgtggt acaagaacat 840 
ggtgtccctg tggatcctct ttgggatggc atggctggcc ttgatcatca aactcatcct 9 00 
ctcccagctg gagacgccag ggagggtatg ttcctgctgc caccacagct ctaaggaaga 9 60 
cttcaagtcc caaagctgga gacagggacc tgaccgggag ccagagtccc actccccaca 1020 
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gcaaggatgc 
cgctgcaggc 
agcaagac cc 
atccacccta 
acatgtcctg 
ccacttcctt 
ccatacggca 
ggatctctga 
tgagagagaa 



tatccagagg 
tgtggcaagg 
ctgattttaa 
gaggctgggt 
ggtgacatgg 
actttagcgg 
ccatgaagta 
tgatcacaga 
ataaacatct 



gacccatggg 
acagctagtt 
gctttgcaca 
gcagctatat 
gatgtgactt 
gctgcaatgc 
gcggcaatgt 
gccatcctaa 
tatgtaaaat 



aatcatacag 
atactccatt 
tgtccaccca 
gattaattct 
tcgggtgtcg 
cgccgatatg 
ttgagcggca 
caaacggaat 
accaaaaaaa 



catctggaac 
ctttggtcgt 
aactaaagac 
gcccaatagg 
gggcagcatg 
atggctggga 
caataagata 
atcacccgac 
aaaaa 



cttctgctca 
cgtcctcggt 
tacattttcc 
gtatacagag 
cccttctccc 
gctctggcag 
ggaagagtct 
ctcctttatg 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1545 



<210> 35 
<211> 1941 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mi sc__f eature 
<223> Incyte ID No: 



7472214CB1 



<400> 35 

atggcggaga 

gccgtgacgc 

gagaaccaca 

atcgagttcg 

tataagaccc 

atgggcccct 

tctggaatga 

aagatgtcct 

gccatgatgg 

gtgacagaga 

ctctctggcg 

gtcatgttct 

tccctcatga 

agtgccaagc 

ttcaaaggcg 

acctaccaca 

aaccccatgt 

agccctgaga 

attgaaagcc 

accttcctgt 

gttattggcg 

ttcaacaaca 

ccaactgtgc 

tggtacagcc 

gtgtgtccgg 

agccgcttcc 

gggctgctga 

accgccatcc 

tacctgcaat 

acgatctatg 

gagccacaga 

ctggtcttgg 

cgggtcaagt 



aggcgctgga 
tggaggacgg 
tcactgaagc 
tggagctgtc 
ttctcaagtg 
caggggctgg 
aggggcagat 
gctacatcat 
tgtctgctaa 
tcctgacggc 
ggcagaggaa 
ttgatgagcc 
agtccctggc 
tctttgagat 
tggtcaccaa 
acccggctga 
tgttcagggc 
agaacgaggt 
acacctttgc 
ccatcctcag 
tgctcatcgg 
ccggctgcct 
tcaccgtccc 
tcaaagcgta 
tggtctactg 
tgctcttctc 
tcggagctgc 
ctgtcctctt 
ggagctccta 
gcatggagcg 
gcatcctccg 
gcatcttctt 
cagagagata 



ggccgtgggc 
ggcggaaccc 
ccagcgcttc 
ctattccgtg 
cctctcaggt 
caagtctaca 
cctggttaat 
gcaagatgac 
cctgaatctt 
actgggcctg 
gcgtctggcc 
caccagtggt 
acaggggggc 
gtttgacaag 
cctgatcccc 
cttcgtcatc 
tgtgcagaat 
ccctgcccca 
caccagcacc 
ggacacggtg 
cctcctctac 
cttcttctcc 
cttagagatg 
ttacctggcc 
cagcattgtg 
agccctggcc 
ttccaactcc 
gttctccggc 
tctctcctat 
aggagacctg 
agcgctggat 
cctagccctg 

g 



tgtggactag 
cc tgtgctga 
tcccacctac 
cgggaggggc 
aaattctgcc 
ttcatgaaca 
ggaaggccac 
atgctgctgc 
actgagaatc 
atgtcgtgct 
atcgccctgg 
ctggatagcg 
cgtaccatca 
ctctacatcc 
tatctaaagg 
gaggtggcct 
gggctgtgcg 
tgccctcctt 
ctcacacagt 
ctgacccacc 
ctgcatattg 
atgctgttcc 
gcggtcttca 
aagaccatgg 
tactggatga 
accgccaccg 
ctacaggtgg 
ttctttgtca 
gtcaggtatg 
acatgtttag 
gtggaggatg 
cggctgctgg 



ggccgggggc 
ccacgcacct 
ccaagcgctc 
cctgctggcg 
gccgggagct 
tcttggcagg 
gggagc tgag 
cgcacctcac 
ccgatgtgaa 
cccacacgag 
agctggtcaa 
cctcttgttt 
tctgcaccat 
tgagccaggg 
gactcggctt 
ctggcgagta 
ctatggctga 
gtcctccgga 
tctgcatcct 
tacggt teat 
gcgacgatgc 
tcatgttcgc 
tgagggagca 
ctgacgtgcc 
cgggccagcc 
ccttggtggc 
ccacttttgt 
gcttcaagac 
gctttgaggg 
aggaaegctg 
ccaagctcta 
cctaccttgt 



tgtggccatg 
gaagaaggtg 
agccgtggac 
caaaaggggt 
gattggcatc 
atacagggag 
gaccttccgc 
ggtgttggaa 
aaacgatctc 
gacagccctg 
caacccgcct 
ccaagtggtg 
ccaccagccc 
teagtgeate 
gcattgcccc 
tggagacctg 
gaagaagagc 
agtggatccc 
cttcaagagg 
gtcccacgtg 
cagcaaggtc 
cgccctcatg 
cctcaactac 
ctttcaggtg 
cgctgagacc 
ccaatctttg 
gggcccagtt 
catccccact 
tgtgatcctg 
cccgttccgg 
catggacttc 
getgegttae 



<210> 36 
<211> 4971 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1941 



7473053CB1 



<400> 36 

caaagtagcg ggccgaggcc egggggageg gggccgcagc tgggggggcg ggagcccgtg 60 

gggagecgag ccgagcgccc cccgccccag cccccggcat gggcagtacg gggccgccgg 12 0 

ggcgggcgcc gagegctgag cgctgagggt ctcccatggg attgctggga tcttgctggg 180 
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tgagatggca gtgtgtgcaa aaaagcgccc 
tgaccgagaa tacaatgaga aattccagta 
caatattctc accttcctgc ctgtcaacct 
ttacttcctg ttcctcctca ttctgcagtt 
caccaccatt gtgcctttgg ttcttgtcct 
tgactatttc cgccacaaga gcgataacca 
caacggaatc ctccagcagg agcagtggat 
agaaaataac cagtttgtgg cggcggatct 
gctgtgttac atagagacag cagaacttga 
gattccagtc acctcagaat tgggagacat 
gatctgtgaa cctcccaaca acaaactgga 
aaataagttc cctctgagca accagaacat 
cgagtggtgc ttcgggctgg tcatctttgc 
cggcagaaca aagttcaaaa gaacgagtat 
gatttttgga ttcctggttt gcatgggggt 
gcacgaggtg gggatgcgtt tccaggtcta 
cttcttctct ggcttcctct ccttctggtc 
catttcactc tatgtcagtg tggaggtcat 
ggataagaag atgttctgca tgaagaagcg 
aaacgaggag ctgggccagg tggagtacat 
gaacatcatg gttttcaaca agtgctccat 
cgccctggga cacaaagctg aattgggaga 
tcctctggct gacaagaagt tcttattttg 
cggggacccc cacacgcatg agttcttccg 
agaagaaaag aacgaaggag agctgtacta 
ggtcaccgca gccaggaact ttggttttgt 
cgtccatgag atgggcacag ccatcaccta 
catccgcaag cggatgtcgg tcatagtgcg 
caaaggggct gacactatcc tactggacag 
caccaccatg gaccacctta atgagtacgc 
ctacaaggat ctggatgaag agtactatga 
cctggcccag gacagccggg aggacaggct 
catgatgctg ctgggtgcaa cggccattga 
cattgccctc ctgacactgg ccaacatcaa 
gacggctgtg aacatcggct attcctgcaa 
catagtcact ggccatactg tcctggaggt 
gatgatggac tcatcccgct ctgtaggcaa 
ttccaagcta acttctgtcc tggaggccgt 
tcacagcctg gcccacgcac tggaggcaga 
tgcctgcaaa gctgtcatct gctgccgggt 
actggtcaag aagtacaaga aggctgtgac 
cagcatgatc aaaacggctc acattggtgt 
tgtcttggcc tccgattact ccttctccca 
gcatgggcgc tggtcctacc tgcgaatgtg 
ctttgctttc accatggtcc acttctggtt 
cgtctatgac cagtatttca tcaccctgta 
ggctatgggg gtctttgatc aggatgtccc 
gtatgagccg ggccagctga accttctctt 
ccagggcatc tacacctccg tgctcatgtt 
cacccgggat gatggcactc agctggctga 
atccttggtc attgtggtta gcgtgcagat 
caaccacttc ttcatctggg gaagccttgc 
cagcaatggg ctcttcgaca tgtttcccaa 
caccttggcc cagcccacgg tgtggctgac 
gcccgtggtt gccttccgat tcctcaggct 
ccgctacaca cagctcgtga ggaagaagca 
tggccgcact ggctcccggc gctccggcta 
gctcatcatg tctggcaaga acatgcggct 
ccgctccagc tccagctgga ttgagagcct 
ccccagtggc ggtgccgaca agcccctcaa 
ccagtgacca gagcacccag ggctggccag 
ggtcctcatt ccttgcttcc cgtccccccg 
atggctggga catctgttcc cagctgtagg 
caggcccaag ggcagagcag aggctgaggc 
agtggaacca aaaacaagaa aaaactgtga 
acccacaggg agactataat ctccttattt 
tgcctctgtt cctgaattac ataagaatgt 



cccagaagaa gaaaggaggg cgcgggctaa 240 
tgcgagtaac tgcatcaaga cc tccaagta 3 00 
ctttgagcag ttccaggaag ttgccaacac 3 60 
gatcccccag atctcttccc tgtcctggtt 420 
caeca tcaca gctgttaaag atgccactga 480 
ggtgaataac cgccagtctc aggtgetgat 540 
gaatgtctgt gttggtgata ttatcaagct 600 
cctcctcctt tccagcagtg agccccatgg 660 
tggegagace aacatgaaag taegtcagge 72 0 
cagtaagctt gccaagtttg acggtgaagt 780 
caaattcagc ggaaccctct actggaagga 840 
getgetgegg ggctgtgtgc tgegaaacac 900 
aggtcccgac actaagctga tgcaaaacag 9 60 
cgatcgccta atgaataccc tggtgctctg 102 0 
gatcctggcc attggcaatg ccatctggga 108 0 
cc tgccgtgg gatgaggcag tggacagtgc 1140 
ctacatcatc atcctcaaca ccgttgtgcc 1200 
ccgtctgggc cacagctact tcatcaactg 12 60 
gacgcctgca gaagcccgca ccaccaccct 13 2 0 
cttctccgac aagaegggea ccctcaccca 1380 
caatggccac agctatggtg atgtgtttga 1440 
gaggectgaa cctgttgact tctccttcaa 1500 
ggaccccagc ctgctggagg ctgtcaagat 1560 
cctcctttcc ctgtgtcata ctgtcatgtc 1620 
caaagctcag tccccagatg agggggcect 1680 
tttccgctct cgcaccccca aaacaatcac 174 0 
ccagctgctg gccatcctgg acttcaacaa 18 00 
gaatccagag gggaagatcc gactctactg 1860 
actgcaccac tccactcaag agetgetcaa 1920 
aggggaaggg .ctgaggaccc tggtgctggc 19 80 
ggagtgggct gagcgacgcc tccaggccag 2040 
ggctagcatc tatgaggagg ttgagaacaa 2100 
ggacaaactt cagcaagggg ttccagagac 2160 
gatttgggtg etaaceggag acaagcaaga 222 0 
gatgetgacg gatgacatga ctgaggtttt 22 8 0 
gegggaggag ctcaggaaag cccgggagaa 23 4 0 
cggcttcacc tatcaggaca agctttcttc 2400 
tgctggggag tacgccctgg tcataaatgg 24 60 
catggagctg gagtttctgg agacagegtg 252 0 
gacccccttg cagaaggcac aggtggtaga 2580 
gettgecatt ggagaeggag ccaatgatgt 2640 
ggggatcagt gggcaggaag ggatccaggc 27 00 
gttcaagttc ctgcagcgcc tcctgctggt 2760 
caagtttctt tgetatttet tctacaaaaa 282 0 
tggcttcttc tgtggcttct cagcccagac 2880 
taacatcgtg tacacctccc tgccagtcct 2940 
egagcagegg agcatggagt accctaagct 3 000 
caacaagegg gagttcttca tctgcatcgc 3 060 
cttcattccc tatggggtgt ttgctgatgc 312 0 
ctaccagtcc tttgeagtea ctgtggccac 3180 
tgggctcgac acaggctact ggaeggecat 324 0 
tgtttacttt gccatcctct ttgecatgea 33 00 
ccagttccgg tttgtgggga atgcccagaa 33 60 
cattgtgctc accacagtcg tetgeatcat 342 0 
caacctgaag ccggatctct ccgacacggt 3480 
gaaggcccag caccgctgca tgcggcgggt 3540 
tgccttctcc catcaggagg gettegggga 3 600 
gagctctctc gcgctctcca gcttcaccac 3 660 
gcgcaggaag aagagtgaca gtgecagtag 3720 
gggctgaagg ccgaggatgg atgccctgtg 3780 
tcactgaggg aacagegtet eggaactget 3840 
gtagactctg tcctgctggt cccaccacac 3 900 
cccttccacc agctggggag ctagagggag 3 9 60 
aeggggagee agccccactc ggggaccaga 4020 
gagattgtgt ctgcccctgc cctgcctggg 4080 
ttttactcct actccccaga ggggecctag 4140 
accatgccgg gaagecagag acctgeaggg 4200 
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gcctcggccc ctcacatcgt gtatgtctct ccttgatttg tgttgtgtcc agtttggttt 4260 

tgtctttttt tatttggcaa gtggaggagg cttttatgtg acttttatgt tgtggttggt 4320 

gtcttaactc tcctgggaaa aggaggctgg cacacactgg gatgccgcag cctggccggc 4380 

tgtggggtgg tttgggagga tccatgtcgg ctctgcctgc agtgaccagt gctctgtggg 4440 

gcagaggagc tgaccaggga gggaggtacc catgagcaga gggtagtggg agagtgtaaa 4500 

ggagggtttg gtcctgtctg cttcctcacc ttgagagtaa agtgctgccc tctgccccca 4560 

acacacacac atatcaattc ctggattcct tagtcctgct ggccttgggc tggagcctag 4620 

gaaagtggcc cccaaatcct tagtgagcta aagctgggtc tgaaatttgg tcagtgggga 4680 

ggggtagttt tcttttcttt tttctttttc tttttttctt tttttttttg agatggagtc 4740 

tcactcttgt cacctaggca agagtgcaat ggcacaatct cagctcactg caacctccac 4800 

ctcctgggtt caagcgattc tcctgcctcc ccggacccaa ccactggact taatctcact 4860 

ttcttaaatt cttctattct cagacacggg tctagtacca ttccttcctc ttagccccag 4920 

ggagcaaatt aaagaggtta cgagttaaaa tcctaaaaaa aaaaaaaaaa a 4971 

<210> 37 
<211> 1404 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 7473347CB1 

<400> 37 

atggtcctgg ctttccagtt agtctccttc acctacatct ggatcatatt gaaaccaaat 60 

gtttgtgctg cttctaacat caagatgaca caccagcggt gctcctcttc aatgaaacaa 120 

acctgcaaac aagaaactag aatgaagaaa gatgacagta ccaaagcgcg gcctcagaaa 180 

tatgagcaac ttctccatat agaggacaac gatttcgcaa tgagacctgg atttggaggg 240 

tctccagtgc cagtaggtat agatgtccat gttgaaagca ttgacagcat ttcagagact 3 00 

aacatggact ttacaatgac tttttatctc aggcattact ggaaagacga gaggctctcc 3 60 

tttcctagca cagcaaacaa aagcatgaca tttgatcata gattgaccag aaagatctgg 42 0 

gtgcctgata tcttttttgt ccactctaaa agatccttca tccatgatac aactatggag 480 

aatatcatgc tgcgcgtaca ccc tgatgga aacgtcctcc taagtctcag gataacggtt 54 0 

tcggccatgt gctttatgga tttcagcagg tttcctcttg acactcaaaa ttgttctctt 600 

gaactggaaa gctatgccta caatgaggat gacctaatgc tatactggaa acacggaaac 660 

aagtccttaa atactgaaga acatatgtcc ctttctcagt tcttcattga agacttcagt 720 

gcatctagtg gattagcttt ctatagcagc acaggctggt acaataggct tttcatcatc 780 

tctgtgctaa ggaggcatgt tttcttcttt gtgctgccaa cctattaccc agccatattg 840 

atggtgatgc tttcatgggt ttcattttgg attgaccgaa gagctgttcc tgcaagagtt 900 

tccctgggaa tcaccacagt gctgaccatg tccacaatca tcactgctgt gagcgcctcc 9 60 

atgccccagg tgtcctacct caaggctgtg gatgtgtacc tgtgggtcag ctccctcttt 1020 

gtgttcctgt cagtcattga gtatgcagct gtgaactacc tcaccacagt ggaagagcgg 1080 

aaacaattca agaagacagg aaagatttct aggatgtaca atattgatgc agttcaagct 1140 

atggcctttg atggttgtta ccatgacagc gagattgaca tggaccagac ttccctctct 1200 

ctaaactcag aagacttcat gagaagaaaa tcgatatgca gccccagcac cgattcatct 1260 

cggataaaga gaagaaaatc cctaggagga catgttggta gaatcattct ggaaaacaac 132 0 

catgtcattg acacctattc taggatttta ttccccattg tgtatatttt atttaatttg 1380 
ttttactggg gtgtatatgt atga 1404 

<210> 38 

<211> 4048 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 7474240CB1 

<400> 38 

cttccatccc ccctcagcca ttccttactg ctctgggcaa ccgccaggtt aagcccattt 60 

gcactgggaa attggcgctg tttgggagaa gagaaacaga tcgattgccc ttgtgactcc 120 

ccgccccctt cccatcccca cccccaccgc tctctccctc tttccctccc ccgccacctc 180 

ccctcacccc gcctccttcc cgttccccac ccccaaaccc tctcacccgc ggcagtccgg 240 

tgcgaggccc cctccggaag gtgaggggaa tggattggac tccggtggag aaagcgggtg 3 00 

tctagaagtg gtgctaatgg gaagagaatt ctggtttcaa aagaggatgc tctgccacaa 360 

agagcggctc gcgcgctggc ctgggctcta gccgaggaga gatcccggga ggactccaga 42 0 

gctccggggg agcgctcctc ggaagaccgg ggccaacatg cctgtgcgca gggggcatgt 480 
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ggcaccacaa aatacatttc tggggaccat cattcggaaa tttgaagggc aaaataaaaa 540 
atttatcatt gcaaatgcca gagtgcagaa ctgtgccatc atttattgca acgatgggtt 600 
ctgtgagatg actggtttct ccaggccaga tgtcatgcaa aagccatgca cctgcgactt 66 0 
tctccatgga cccgagacca agaggcatga tattgcccaa attgcccagg cattgctggg 720 
gtcagaagag aggaaagtgg aggtcaccta ctatcacaaa aatgggtcca cttttatttg 780 
taacactcac ataattccag tgaaaaacca agagggcgtg gctatgatgt tcatcattaa 84 0 
ttttgaatat gtgacggata atgaaaacgc tgccacccca gagagggtaa acccaatatt 9 00 
accaatcaaa actgtaaacc ggaaattttt tgggttcaaa ttccctggtc tgagagttct 9 60 
cacttacaga aagcagtcct taccacaaga agaccccgat gtggtggtca tcgattcatc 1020 
taaacacagt gatgattcag tagccatgaa gcattttaag tctcctacaa aagaaagctg 1080 
cagcccctct gaagcagatg acacaaaagc tttgatacag cccagcaaat gttctccctt 1140 
ggtgaatata tccggacctc ttgaccattc ctctcccaaa aggcaatggg accgactcta 1200 
ccctgacatg ctgcagtcaa gttcccagct gtcccattcc agatcaaggg aaagcttatg 12 60 
tagtatacgg agagcatctt cggtccatga tatagaagga ttcggcgtcc accccaagaa 132 0 
catatttaga gaccgacatg ccagcgaaga caatggtcgc aatgtcaaag ggccttttaa 13 80 
tcatatcaag tcaagcctcc tgggatccac atcagattca aacctcaaca aatacagcac 1440 
cattaacaag attccacagc tcactctgaa tttttcagag gtcaaaactg agaaaaagaa 150 0 
ttcatcacct ccttcttcag ataaaaccat tattgcaccc aaggttaaag atcgaacaca 1560 
caatgtgact gagaaagtga cccaggttct ctctttagga gcagatgtcc tacctgaata 162 0 
caaactgcag acaccacgca tcaacaagtt tacgatattg cactacagcc ctttcaaggc 1680 
agtctgggac tggcttatcc tgctgttggt catatacact gctatattta ctccctactc 1740 
tgcagccttc ctcctcaatg acagagaaga acagaaaaga cgagaatgtg gctattcttg 1800 
tagccctttg aatgtggtag acttgattgt ggatattatg tttatcatag atattttaat 1860 
aaacttcaga acaacatatg taaatcagaa tgaagaagtg gtaagtgatc ccgccaaaat 1920 
agcaatacac tacttcaaag gctggttcct gattgacatg gttgcagcaa ttccttttga 19 80 
cttgctgatt tttggatcag gttctgatga gacaacaaca ttaattggtc ttttgaagac 2040 
tgcccgactc ctccgtcttg tgcgcgtggc caggaaactg gatcgatatt cagaatatgg 2100 
cgctgctgtt ctaatgctct taatgtgcat ctttgccctg attgctcact ggctggcttg 2160 
catttggtat gcgattggga atgtagaaag gccttacctg actgacaaaa tcggatggtt 2220 
ggattcctta ggacagcaaa ttgggaaacg ttacaatgac agtgactcaa gttctggacc 2280 
atccattaaa gacaaatacg tcacagcact ttattttacc ttcagcagtt taaccagtgt 2340 
aggattcggg aatgtgtctc ctaacacgaa ttcggagaaa atcttttcaa tttgtgtcat 2400 
gttgattggc tcactaatgt atgcaagcat ttttgggaat gtatctgcaa ttatccaaag 2460 
actatactcg ggaactgcca ggtaccacat gcagatgctg cgagtaaaag agttcattcg 2520 
ctttcaccaa atccccaacc ctctgaggca acgtcttgaa gaatatttcc agcacgcatg 2580 
gacttacacc aatggcattg acatgaacat ggtcctaaag ggtttcccag aatgcttaca 2640 
agcagacatt tgtctacatc tcaaccagac attgctgcaa aactgcaaag cctttcgggg 27 00 
ggcaagtaaa ggttgcctta gagctttggc aatgaagttc aaaaccaccc atgcacctcc 27 60 
aggagacacc ctcgttcact gtggggatgt cctcactgca ctttatttct tatccagagg 2820 
ctccattgaa attctcaaag atgacattgt ggtggctatt ctgggaaaaa atgatatatt 2880 
tggagaaatg gttcatcttt atgccaaacc tggaaagtct aatgcagatg taagagccct 2940 
cacatactgt gacttgcata agattcagcg agaagacttg ttagaggttt tggatatgta 3 000 
tcctgagttt tctgatcact ttctaacaaa cctagagttg actttcaacc taaggcatga 3060 
gagcgcaaag gctgatctcc tacgatcaca atccatgaat gattcagaag gagacaactg 3120 
taaactaaga agaaggaaat tgtcatttga aagtgaagga gagaaagaaa acagtacaaa 3180 
tgatcctgaa gactctgcag ataccataag acattatcag agttccaaga gacactttga 3240 
agagaaaaaa agcagatcct catctttcat ctcctccatt gatgatgaac aaaagccgct 33 00 
cttctcagga atagtagact cttctccagg aatagggaaa gcatctgggc tcgattttga 33 60 
agaaacagtg cccacctcag gaagaatgca catagataaa agaagtcact cttgcaaaga 3420 
tatcactgac atgcgaagct gggaacgaga aaatgcacat ccccagcctg aagactccag 3480 
tccatctgca cttcagcgag ctgcctgggg tatctctgaa accgaaagcg acctcaccta 3 540 
cggggaagtg gaacaaagat tagatctgct ccaggagcaa cttaacaggc ttgaatccca 3 600 
aatgaccact gacatccaga ccatcttaca gttgctgcag aaacaaacca ctgtggtccc 3 660 
cccagcctac agtatggtaa cagcaggatc agaatatcag agacccatca tccagctgat 3720 
gagaaccagt caaccggaag catccatcaa aactgaccga agtttcagcc cttcctcaca 37 80 
atgtcctgaa tttctagacc ttgaaaaatc taaacttaaa tccaaagaat ccctttcaag 3840 
tggggtgcat ctgaacacag cttcagaaga caacttgact tcacttttaa aacaagacag 3 9 00 
tgatctctct ttagagcttc acctgcggca aagaaaaact tacgttcatc caattaggca 3960 
tccttctttg ccagattcat ccctaagcac tgtaggaatc gtgggtcttc ataggcatgt 4020 
ttctgatcct ggtcttccag ggaaataa 4048 

<210> 39 

<211> 1539 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 7475338CB1 



<400> 39 

atggagaaca 

ccgacgctgt 

aacctctctg 

tttgagcgac 

gtctccatgt 

agctgcggca 

ctgatgggag 

ctgggagtct 

cccaagaacc 

ttcctagcac 

cttctggcgc 

gaaagccccc 

aggaggctga 

cgggccgagc 
cgctggcagc 
gcgatcaact 
caatatgtaa 
cttgtggagc 
gcctgcctgg 
ctcggcatca 
ccctcggtgg 
gacggggcag 
gaggccatcg 
tacatctacg 
tttgccaaga 
cctcccacag 



aagaggcggg 
tgctggcgac 
tggtcaacac 
acgcaacatt 
ttcctctggg 
ggaaggggac 
tcagcaaagt 
gtgcaggtat 
tgagaggcat 
agate ttcag 
tcacaggggt 
gctactccct 
gaggccacac 
gcgccgaggg 
tcctctccat 
actatgegga 
cggtgggctc 
ggctgggacg 
tgctgacggt 
tctgtgtctt 
tgaggaccga 
tgcactggct 
gtgcctacag 
tggttattcc 
gaaacagggt 
cctctcctgc 



aacccctcca 
actgagegeg 
gccgcacaag 
catggacggg 
cggcctgttg 
cctgctgatc 
ggecaagget 
ctcctacagc 
ggtgggaaca 
cctccaggcc 
gcccgccctg 
gattcagaaa 
ggacatggag 
ccacctgtct 
catcgtgctc 
caccatctac 
tggcgtcgtc 
gcggcacctc 
ggtgctccta 
tgcctacatc 
gatcttcctg 
caccaact tc 
tttcatcatc 
ggagaccaag 
gaagcttcca 
caaggaaact 



cccattccat 
gectttgget 
gtgttcaagt 
aagctcatgc 
gggtcattgc 
aacaacatct 
tttgagctga 
gcccttccca 
atgaccgagg 
atcttgggca 
ctgcagctgc 
ggagatgaag 
gecgagctgg 
gtgctgcacc 
atggccggcc 
acatctgegg 
aacatagtga 
ctgctggccg 
ttccagaaca 

gegggacatt 

cagtcctccc 
atcataggct 
tttgccggaa 
ggcaaaacat 
gaggagaaag 
tccttttag 



ccagggaggg 
cagccttcca 
cattttacaa 
tgcttctatg 
tcgtgggcct 
ttgecatcat 
tegtctttte 
tgtacctggg 
ttttegtcat 
acccggcagg 
tgaccctgcc 
ccacagcgcg 
aggacatgeg 
tctgtgccct 
agcagctgtc 
gcgtggaggc 
tgaccatcac 
getaeggcat 
gggtccccga 
ccattgggcc 
ggegggcage 
tcctgttccc 
tctgcctcct 
ttgtggagat 
aagaaaccat 



gcggctccag 
gtaeggctae 
cgaaacctac 
gtcttgcacc 
gctggttgat 
ccccgccatc 
ccgagtggtg 
agaactggee 
cgttggagtc 
ttggccggtg 
cttcttcccc 
acaagc tctg 
tgeggaggee 
gcggtccctg 
gggcatcaat 
cgctcactcc 
ctcggctgtc 
ctgcggctct 
gctgtcctac 
cagtcctgtc 
tttcatggtg 
atccatccag 
cactgegatt 
aaacegcatt 
tgatgctggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1539 



<210> 40 
<211> 3114 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 7476747CB1 



<400> 40 

ccaagcagtg 

agtgtagcac 

cgcctctgtg 

ccatagtgga 

ccccaaacat 

cagctgccat 

acgtgagccc 

accagcgctt 

aaggcaacat 

tgatgggtcc 

tggtgaaatg 

atactgtgat 

ggggaagacg 

attttgtgtt 

ataactgeca 

tgctctcctt 

ccatcttctc 

tcattgttca 

accctctctt 

tggaaaacaa 

tcgtcaccat 

tccaaggcag 

tgtactccat 

tcatcccctt 

tgcgcccagt 

geaeggaage 



cctcacttct 
caagtatacc 
tactttgeae 
gctgtcacag 
ggagctaaag 
gtccacgcag 
tgaggagagc 
tggtcaaagc 
tggcacagga 
catcagcctg 
tgctcaccac 
gtatggacta 
tgttgtggac 
tctggctgac 
caacaatgag 
cctgcccttc 
cctgttggcc 
gaggatccca 
ctttggcaca 
aatgaaggat 
cctctacatc 
cataaccctc 
egggatcttt 
ctttgtgtcc 
gctggtctgc 
cgatggcacc 



gec ttgtcta 
aatgggaagg 
ttttccattt 
tgagcaactc 
gctccagctg 
agacttegga 
ccgtcggaag 
aatagcacaa 
ctcctgggac 
ctgatcatag 
ttctgccgca 
gaatccagcc 
ttcttcctga 
aactttaaac 
aeggtgatte 
ctggtgctgc 
aacatcacca 
gaccccagcc 
gcgatttttt 
ecteggaagt 
agcctggggt 
aacctgccca 
ttcacctacg 
cgagcgcccg 
ctgacatcac 
tcctgcggca 



gctgtactct 
atgggagtca 
cccttggtac 
tgacaatgac 
caggaggtct 
atgaagacta 
gcctcaacaa 
catggttcca 
tccctctggc 
gcatcgtggc 
ggctgaataa 
cctgctcctg 
ttgtcaccca 
aggtgataga 
tgacgcctac 
tggttttcat 
tgctggtcag 
acctcccctt 
catttgaagg 
tcccactcat 
gtctggggta 
actgctggtt 
cactccagtt 
agccctgtga 
tgtctggcag 
gtgeaccatt 



ggaaaattaa 
gaagtcaagt 
caggcacttt 
agcttctacc 
taatgctggc 
ccacgactac 
cctctcctcc 
gaccttgatc 
ggtgaaaaat 
cgtgcactgc 
atcctttgtg 
gctccggaac 
gctgggattc 
ageggecaat 
catggactcg 
caggaacctc 
cttggtcatg 
ggtggcccct 
cattggaatg 
cctgtacctg 
ectgeaattt 
gtaccagtca 
ctacgtcccg 
gttagtggtg 
tgttgacaat 
ggtcttcgtc 



gaaatttatg 
gaactcagcc 
catacttaat 
ecagaggega 
cctgtccccc 
agctccacgg 
ccgggctcct 
cacctgttaa 
gcaggcatcg 
atgggtatcc 
gattatggtg 
cacgcacact 
tgctgtgtct 
gggaccacca 
cgactctaca 
cgagccctgt 
atctaccagt 
tggaagacct 
gttctgcccc 
ggcatggtca 
ggagctaata 
gttaagctgc 
gctgagatca 
gacctgtttg 
ggctggtatg 
agttcctcct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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tcctggctca cccgtggctg agtttcagat gtgagagcca gtgggtgtcc tgtcacagag 1620 
atacggtcgt cgtgtggggc ttcgccaggg gcatcttggc catcctcatc ccccgcctgg 1680 
acctggtcat ctccctggtg ggctccgtga gcagcagcgc cctggccctc atcatcccac 1740 
cgctcctgga ggtcaccacc ttctactcag agggcatgag ccccctcacc atctttaagg 1800 
acgccctgat cagcatcctg ggcttcgtgg gctttgtggt ggggacctat gaggctctct 1860 
atgagctgat ccagccaagc aatgctccca tcttcatcaa ttccacctgt gccttcatat 1920 
agggatctgg gttcgtctct gcagctgcct acccctgccc catgtgtccc ccgttacctg 1980 
tcctcagagc ctcaggtatg gtccaggctc tgaggaaagt cagggttgct gtgtgggaac 2040 
ccctctgcct ggcacctgga taccctgggc caggtaacct gagggcagtg gagaggtggg 2100 
gtggcagaca cgcagaagtg ctactagtga cagggctgcc atcgctcacc tgtacctatt 2160 
tacacccaga actttccagc tccccctcat catgcctcct ccttcctacc tgcctcccct 2220 
ctgctggtgc acctcgccca actcattctt actgcacagt tcactttatt taacaatttt 2280 
catgtccccc acctcatgtt ttcacctttt ctgggccagg catagattaa gtaactggga 2340 
acgccccctc tttataaagc tgggcttctt tctcatctct ctcccaaatg ttgtatctca 2400 
gtattcttcc tattcgagtc tccagggggt ggctggacct acctggtcat ttgaaacagg 2460 
cccccaagct ggagttttta atctggactc tctggcttgc tgtgacccct aaggcaatgc 2520 
ttctcttccc tggattcctt agtgtgggtc acagtactgt gttcttagtt gctttagctc 2580 
ttaaaacata cgaagtgttg cctaaactga aaatatttat cttttattta aaatcagatt 2 64 0 
tttgttttta gactgtctta gatctggggc tattacgaat cacttcttct tcagtaaact 27 00 
ttgactcaac ttctcctgct gaaaagaagc tcgctccaga tgtctgcatg ggtcctcggc 2760 
actcttggct gaggactcaa aggttttaat caggatcgtc taaaaatgta cctcggtgag 282 0 
gaggcacaga ttttgcctcc tgttgaccag cctggtttca taccgaaaag acattgaagg 2880 
actgcagaaa tgtatgggtg caccgggccg agggaagggt ggctgagtga gaggcgtata 2940 
aaatggggct gtgtgcatgc aggcccatgt ttcagcctca gcccacgcca ggtgaaagga 3 000 
tcagcaatgc tctgttgcca tcgtgctggg acgacaccag ctctattgcc accgatgagt 3 060 
agctgaggtc agtgtgcaca gagtttgaaa ttaagttaat agactttaca gcag 3114 

<210> 41 

<211> 2877 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 7477898CB1 

<400> 41 

atgccggtcc gcaggggcca cgtcgctccc caaaacactt acctggacac catcatccgc 60 
aagttcgagg gccaaagtcg gaagttcctg attgccaatg ctcagatgga gaactgcgcc 12 0 
atcatttact gcaacgacgg cttctgcgaa ctcttcggct actcccgagt ggaggtgatg 180 
cagcaaccct gcacctgcga cttcctcaca ggccccaaca caccaagcag cgccgtgtcc 240 
cgcctagcgc aggccctgct gggggctgag gagtgcaagg tggacatcct ctactaccgc 3 00 
aaggatgcct ccagcttccg ctgcctggta gatgtggtgc ccgtgaagaa cgaggacggg 3 60 
gctgtcatca tgttcattct caacttcgag gacctggccc agctcctggc caagtgcagc 42 0 
agccgcagct tgtcccagcg cctgttgtcc cagagcttcc tgggctccga gggctctcat 480 
ggcaggccag gcggaccagg gccaggcaca ggcaggggca agtacaggac catcagccag 540 
atcccacagt tcacgctcaa cttcgtggag ttcaacttgg agaagcaccg ctccagctcc 600 
accacggaga ttgagatcat cgcgccccat aaggtggtgg agcggacaca gaacgtcact 660 
gagaaggtca cccaggtcct gtccctgggc gcggatgtgc tgccggagta caagctgcag 720 
gcgccgcgca tccaccgctg gaccatcctg cactacagcc ccttcaaggc cgtgtgggac 780 
tggctcatcc tgctgctggt catctacacg gctgtcttca cgccctactc agccgccttc 840 
ctgctcagcg atcaggacga atcacggcgt ggggcctgca gctatacctg cagtcccctc 9 00 
actgtggtgg atctcatcgt ggacatcatg ttcgtcgtgg acatcgtcat caacttccgc 9 60 
accacctatg tcaacaccaa tgatgaggtg gtcagccacc cccgccgcat cgccgtccac 1020 
tacttcaagg gctggttcct cattgacatg gtggccgcca tccctttcga cctcctgatc 1080 
ttccgcactg gctccgatga gaccacaacc ctgattgggc tattgaagac agcgcggctg 1140 
ctgcggctgg tgcgcgtagc acggaagctg gaccgctact ctgagtatgg ggcggctgtg 120.0 
ctcttcttgc tcatgtgcac ctttccgctc atagcgcact ggctggcctg catctggtac 1260 
gccatcggca atgtggagcg gccctaccta gaacacaaga tcggctggct ggacagcctg 1320 
ggtgtgcagc ttggcaagcg ctacaacggc agcgacccag cctcgggccc ctcggtgcag 13 80 
gacaagtatg tcacagccct ctacttcacc ttcagcagcc tcaccagcgt gggcttcggc 1440 
aatgtctcgc ccaacaccaa ctccgagaag gtcttctcca tctgcgtcat gctcatcggc 1500 
tccctgatgt acgccagcat cttcgggaac gtgtccgcga tcatccagcg cctgtactcg 1560 
ggcaccgcgc gctaccacac gcagatgctg cgtgtcaagg agttcatccg cttccaccag 1620 
atccccaacc cactgcgcca gcgcctggag gagtatttcc agcacgcctg gtcctacacc 1680 
aatggcattg acatgaacgc ggtgctgaag ggcttccccg agtgcctgca ggctgacatc 1740 
tgcctgcacc tgcaccgcgc actgctgcag cactgcccag ctttcagcgg cgccggcaag 1800 
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ggctgcctgc 
ctggtgcacc 
atcctgcgcg 
gtcagcctcc 
gacctgcaca 
gcggagagct 
ctccactcat 
gacaaccagt 
gage tact gc 
tgctggcctc 
gagtcccgcg 
cagggccacg 
atagectegg 
cagaccccaa 
aggatgeetc 
cagectgagg 
ctcatctgtg 
ctggacttcc 



gcgcgctagc 
teggegaegt 
acgacgtggt 
atgcccagcc 
agate cage g 
tctggagtaa 
ccccccgaca 
cagatgeage 
aggaaatgee 
tgaagctggg 
tgtcctcaga 
ccagctacat 
agacgacgag 
gctatggaga 
acctggctgt 
ggctctggcc 
gtccctgctt 
agagacatgg 



cgtcaagttc 
gctctccacc 
cgtggccatc 
aggcaagtcc 
ggcagatctg 
gctggaggtc 
ggctcctggc 
ccctcccc tg 
cccaaagcac 
ctccaggcta 
cctcagccgc 
tetggaagee 
tccagggccc 
cttggatgac 
ggcaatggac 
acccctagcc 
ctcctccctc 
ctcagatcct 



aagaccaccc 
ctctacttca 
ctaggaaaga 
agtgcagacg 
ctggaggtgc 
accttcaacc 
agccaagacc 
agcatctcag 
agcccccaaa 
gagcagctcc 
atettgeage 
cctgcctcca 
aggctgcccc 
tgtagtccaa 
aaaactctgg 
tcacctctac 
cctgaacacc 
ggatttgcag 



acgcgccgcc 
tctcccgagg 
atgacatctt 
tgegggctet 
tggacatgta 
tgcgggacgt 
accaaggttt 
atgcattctg 
gccctcagga 
aggeccagat 
tcctccagaa 
atgacctggc 
agggctttct 
agcacaggaa 
caccatcctc 
atcccctgga 
ttggctctgt 
ggagttgggg 



tggggacacg 
ctccatcgag 
tggggaac cc 
gacctactgc 
cccggccttt 
aacegggggt 
ctttctcagt 
gctctggcct 
agacccagat 
gaacaggctg 
gcccatgccc 
cttggttcct 
gcctcctgca 
ctcctccccc 
agaacaggaa 
agtacaagga 
tcccaagcag 
ccactga 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2877 



<210> 42 

<211> 2820 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472728CB1 



<400> 42 

atggggcatc 

aggaggaagg 

ggagggtggg 

ccctttagtc 

ccccaggaag 

aacgcttgta 

accttgactc 

tatgeaggat 

catgtgatca 

gatggattta 

gcagcaccgg 

actccaactc 

acctcccaag 

gttgtggtgg 

gagagcagee 

tgtgtgagcc 

ggagtcagtc 

tttttctttg 

ggaggcaaaa 

ttggctggaa 

aaggtctttc 

ttcatcttgg 

gagggctgga 

tttggtgatt 

tggtataagc 

ctcagtatga 

gaaatcaagg 

eggegaagge 

atggagcgcc 

gagaageget 

agcatcaaca 

gggcagggtg 

aagaggaaaa 

accttccgga 

aactcagaca 

gagaaeggaa 

gaagacagaa 

tgtgtgtgtt 



aagggecatt 
aggcttggag 
gccatcgtgg 
ttttgggtcc 
gcagactcag 
caaaccatga 
tggatttctg 
taactctgea 
cactgacacc 
agggggacag 
tgtgccagcc 
cgcgcctgtc 

ggggcttgea 

tctaccttgt 
agaagaatac 
cccaggagct 
caataggaaa 
ctggaactgt 
tcttttgtat 
ttggagacca 
gaaaaaagca 
ccggctgcat 
cggccttgga 
ttgtggcagt 
ccctagtgtg 
teggagattg 
cccatgcggc 
tcagcgtgga 
ggcggctggg 
ctgtctttgc 
accggcccaa 
cgtccgagga 
acaaggacct 
attactccct 
actccagcac 
tgatacccac 
actaaatgtg 
tttaatattc 



tgaagaagga 
aagggactgt 
aagacagacc 
tcaaatcaca 
ggtaaattct 
agaactctct 
cctatgggaa 
gtcaaagttt 
tgagagagtg 
gactgaaggc 
caagagcgcc 
catttcctcc 
gaeegtcatg 
cactggcggt 
catcgccttg 
ggagacgttg 
ctcttccaac 
cattacgacc 
tttatatgee 
acttggaacc 
agtgagtcag 
tgtgtttgtg 
gtccatttac 
ggttgttttc 
gttttggatc 
getaegggtt 
agagtggaag 
gatccacgat 
cctggaccag 
tgccctggac 
caacctgcgc 
caacatcatc 
caaaaagacc 
ggacgaggag 
agecatgetg 
ggacaccaaa 
aaggacattg 
acactgagac 



aatggtggac 
cttttaggag 
tatggtaggg 
gtttatgcag 
gcctgtcttc 
ttggactgtt 
aaaaccacaa 
ctgttgggca 
acacaccatg 
tgtcgcagtg 
actaaeggge 
cgagccacag 
aagtggaaga 
cttgtcttcc 
gagaaggegg 
atccagcatg 
aacagcagcc 
atgtatggga 
atctttggaa 
atctttggga 
accaagatcc 
acgatccctg 
tttgtggtgg 
aggggaaacg 
cttgttggcc 
ctgtccaaaa 
gecaatgtea 
aagctgeage 
cgggcccact 
accggccgct 
ctgaaggggc 
aacaagttcg 
ttgeccgagg 
aagaaagagg 
acggactgta 
gaeegggage 
gtcttggact 
atgtgcctta 



tgagagtgat 
ccctgcccag 
cttgtggggt 
tttggcccca 
caccagagag 
tgctttttga 
tagtgecagg 
gagcattgtt 
tacatggctg 
attcagtggc 
aacccccggc 
tggtagccag 
cggtggttgc 
gggcattgga 
aattcctgcg 
etcttgatge 
actgggacct 
atattgetec 
ttccactctt 
aaagcattgc 
gggtcatctc 
ctgtcatctt 
tcactctgac 
ctggcatcaa 
ttgectaett 
agacaaaaga 
eggctgagtt 
gggcagccac 
cactggacat 
tcaaggcctc 
eggagcaget 
ggtccacctc 
acgttcagaa 
aggagaegga 
tccagcagca 
eggagaacaa 
gagcgttgtg 
aacagacttt 



agegacctgg 
tgtttcctgt 
gaaagaaaag 
gtcagaggga 
gggactcacc 
gaatgttaac 
ggtgcttcca 
agcaggggtc 
gtatatggag 
cgttcccgca 
tccggctccg 
gatggaaggc 
catctttgtg 
gcagcccttt 
ggatcatgtc 
tgacaatgcg 
cggcagtgcc 
gagcactgaa 
tggtttctta 
aagagtggag 
aaccatcctg 
taagtacatc 
cacggtgggc 
ttategggag 
tgcagctgtc 
agaggtgggt 
cegggagaca 
catccgcagc 
gctgtccccc 
atcccaggag 
gaacaagcat 
cagactcacc 
aatctacaag 
aaagatgtgt 
cgctgagttg 
ctcattactt 
tgtgtgtgtg 
ttagtccaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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attacatagc attgaagaat atatttcact gtgccataaa caactgaaag cttgctctgc 2340 
caaaaggaat cagagaacaa gaacttcatt tcagatagca aacgcaggac acaccaagag 24 00 
tgtccgtgca cgtagccggt tctggccgta catgttaagg gcatttcagt ggcagtgctg 24 60 
tacccctggg cagtgctacc tgggcacaca cgtagacaag ggcagctatt ccttagacca 2520 
gcctcctgaa agaaacaggt gtgtcttttt agtggagtcg tagtaatatg tgcacacaca 2580 
gaaggggacc tgattgggtg ggagctggtt atgtgtaact agcgttggag ttgacatttt 2 640 
ggcatgtgct ctgagcttga attttgatac caaccattca gtgcatcata cctagtcttt 27 00 
ctatgctcca aatgaatgtc tgtggggacc tgagagcacc tggaatttgt tggaagcaga 27 60 
tcagagcaca cgtacgaaaa ggtgcaattc cttttctcat gacaaaaggg aaaaaaataa 282 0 

<210> 43 

<211> 1440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474322CB1 

<220> 

<221> unsure 
<222> 757 

<223> a, t, c, g, or other 
<400> 43 

atgtacaatg agattctgat gctgggggcc aaactgcacc cgacgctgaa gctggaggag 60 
ctcaccaaca agaagggaat gacgccgctg gctctggcag ctgggaccgg gaagatcggg 12 0 
aatcgccacg acatgctctt ggtggagccg ctgaaccgac tcctgcagga caagtgggac 180 
agattcgtca agcgcatctt ctacttcaac ttcctggtct actgcctgta catgatcatc 240 
ttcaccatgg ctgcctacta caggcccgtg gatggcttgc ctccctttaa gatggaaaaa 3 00 
actggagact atttccgagt tactggagag atcctgtctg tgttaggagg agtctacttc 3 60 
tttttccgag ggattcagta tttcctgcag aggcggccgt cgatgaagac cctgtttgtg 42 0 
gacagctaca gtgagatgct tttgtttctg cagtcactgt tcatgctggc caccgtggtg 480 
ctgtacttca gccacctcaa ggagtatgtg gcttccatgg tattctccct ggccttgggc 540 
tggaccaaca tgctctacta cacccgcggt ttccagcaga tgggcatcta tgccgtcatg 600 
atagagaaga tgatcctgag agacctgtgc cgtttcatgt ttgtctacat cgtcttcttg 6 60 
ttcgggtttt ccacagcggt ggtgacgctg attgaagacg ggaagaatga ctccctgccg 720 
tctgagtcca cgtcgcacag gtggcggggg cctgctngca ggcccaatag ctcctacaac 780 
agcctgtact ccacctgcct ggagctgttc aagttcacca tcggcatggg cgacctggag 840 
ttcactgaga actatgactt caaggctgtc ttcatcatcc tgctgctggc ctatgtaatt 900 
ctcacctaca tagttctcct cctcaacatg ctcattgctc tgatgggcga gactgtggag 960 
aacgtctcca aggagagcga acgcatctgg cgcctgcaga gagccatcac catcctggac 1020 
acggagaaga gcttccttaa gtgcatgagg aaggccttcc gctcaggcaa gctgctgcag 1080 
gtggggtaca cacctgatgg caaggacgac taccggtggt gc ttcgtgga cgaggtgaac 1140 
tggaccacct ggaacaccaa cgtgggcatc atcaacgaag acccgggcaa ctgtgagggc 12 00 
gtcaagcgca ccctgagctt ctccctgcgg tcaagcagag tttcaggcag acactggaag 12 60 
aactttgccc tggtccccct tttaagagag gcaagtgctc gagataggca gtctgctcag 1320 
cccgaggaag tttatctgcg acagttttca gggtctctga agccagagga cgctgaggtt 13 80 
ttcaagagtc ctgccgcttc cggggagaag tgaggacgtc acgcagacag cactgtcaac 1440 

<210> 44 

<211> 2394 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5455621CB1 

<400> 44 

atttcagaac acatctgaat tccttctctg tggcatatgc tttaggagag gagcagacag 60 
ctcttagcta gggtcagatt tcaaattctc atctcttggt gccaatacca ccaccagatt 120 
cttctttgaa gtcaactttt gagatcttca ctaagtacac gttggtgtct gaagattcac 180 
acgagtgcct ctggtaatca ttttcttcag ggaatcacag tctctcctct cagcaaagca 240 
tccactgtac tgaactttgc ttttggaaac atcttcttcc tgagacctcg ttgaaagaaa 300 
ctctctggtg tcatactttc caatatggag gtgaagaact ttgcagtttg ggattatgtt 3 60 
gtatttgcag ccctcttttt catttcctct ggaattgggg tgttctttgc cattaaggag 42 0 
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agaaaaaagg caacttcccg agagttcctg gttgggggaa ggcaaatgag ctttggccct 480 
gtcggcttgt ctctgacagc cagcttcatg tcagctgtca cggtcctggg gaccccttct 540 
gaagtctacc gctttggggc atccttccta gtcttcttca ttgcttacct atttgtcatc 600 
ctcttaacat cagagctctt tctccctgtg ttctacagat ctggtatcac cagcacttat 660 
gagtacttac aactacgatt caacaaacca gttcgctatg ctgccacagt catctaeatt 720 
gtacagacga ttctctacac aggagtggtg gtgtatgctc ctgccctggc actcaatcaa 780 
gtgactgggt ttgatctctg gggctctgtg tttgcaacag gaattgtttg cacattctac 840 
tgtaccctgg gaggattaaa agcagtggtg tggacagatg catttcagat ggttgtcatg 900 
attgtgggct tcttaacggt tctcattcaa ggatcaactc atgctggggg attccacaat 960 
gtattagagc aatcaacaaa tggatctcga ctacatatat ttgactttga tgtagatcct 1020 
ctcaggcgac acactttttg gactatcaca gtgggaggaa cttttacttg gctcggaatc 1080 
tatggggtca atcaatcaac tattcagcga tgcatctctt gcaaaacaga aaagcatgct 1140 
aagcttgcct tgtattttaa cttgctgggt ctctggatca ttctggtgtg tgctgtcttc 1200 
tctggcttaa tcatgtactc tcactttaaa gactgtgacc cttggacttc tggcatcatc 1260 
tcagcaccag accagctgat gccgtacttt gtcatggaga tatttgccac aatgccagga 132 0 
ctgccaggac tttttgtggc ttgtgccttc agtggaactc tgagcaccgt ggcttccagc 1380 
atcaatgcct tggcaacagt gacctttgag gattttgtca agagctgttt tcctcatctc 1440 
tccgacaagc tgagcacctg gatcagtaaa ggcttatgtc tcttatttgg cgtgatgtgt 1500 
acctctatgg ctgtggctgc atctgtcatg ggaggtgttg tgcaggcttc cctcagcatt 1560 
cacggcatgt gtggaggacc aatgctgggc ttattctccc tgggaatcgt gttccctttt 1620 
gtgaactgga agggtgcact aggaggtctt cttactggaa tcaccttgtc attttgggtg 1680 
gccattgggg ccttcattta ccctgcacca gcctctaaga catggccttt gcctctatca 1740 
acagaccaat gtatcaaatc aaatgtgaca gcaacagggc ctccagtact atccagcaga 1800 
cctggaatag ctgatacctg gtactcgatc tcctaccttt actacagtgc actgggctgc 18 60 
ttaggatgca ttgttgctgg agtaatcatc agcctcataa caggtcgcca aagaggtgag 1920 
gatattcaac cactgttaat tagaccagtt tgtaatttat tttgcttttg gtctaagaag 1980 
tacaaaacac tatgctggtg tggagttcag catgacagtg ggacagagca ggaaaacctt 2 040 
gagaatggca gtgcccggaa acagggggct gaatctgtct tacagaacgg actcagaaga 2100 
gaaagcctgg tacatgttcc aggctatgat cctaaggaca aaagctacaa caatatggca 2160 
tttgagacta cccatttcta aggcaatacc tgtatgaatg cacacacaca cgtgcaatac 2220 
acacacacac acacaaactc cacatacttc ttgcctactt gttagtagat atgtatagtt 2280 
gccattgcta gaagacaggg atgtctggtg cctatttcta cttatttata actacatgca 2340 
aaatgactgt ctctcgggat attctttgaa agactccaac tttcacagag aaaa 23 94 

<210> 45 

<211> 2890 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7477248CB1 

<400> 45 

gaatactaag ccagggcaga atgcttgtga agtagcaact aaagtggcag tgtttcttct 60 

gaaattctca ggcagtcaga ctgtcttagg caaatcttga taaaatagcc cttatccagg 120 

tttttatcta aggaatccca agaagactgg ggaatggaga gacagtcaag ggttatgtca 180 

gaaaaggatg agtatcagtt tcaacatcag ggagcggtgg agctgcttgt cttcaatttt 240 

ttgctcatcc ttaccatttt gacaatctgg ttatttaaaa atcatcgatt ccgcttcttg 3 00 

catgaaactg gaggagcaat ggtgtatggc cttataatgg gactaatttt acgatatgct 3 60 

acagcaccaa ctgatattga aagtggaact gtctatgact gtgtaaaact aactttcagt 420 

ccatcaactc tgctggttaa tatcactgac caagtttatg aatataaata caaaagagaa 480 

ataagtcagc acaacatcaa tcctcatcaa ggaaatgcta tacttgaaaa gatgacattt 540 

gatccagaaa tcttcttcaa tgttttactg ccaccaatta tatttcatgc aggatatagt 600 

ctaaagaaga gacacttttt tcaaaactta ggatctattt taacgtatgc cttcttggga 660 

actgccatct cctgcatcgt catagggtta attatgtatg gttttgtgaa ggctatgata 72 0 

catgctggcc agctgaaaaa tggagacttt catttcactg actgtttatt ttttggttca 780 

ctgatgtctg ctacagatcc agtgacagtg ctggccattt tccatgaact gcacgtcgac 840 

cctgacctgt acacactctt gtttggagag agtgtgttga atgatgcagt ggccatagtc 900 

cttacatatt ctatatccat ttacagtccc aaggagaatc caaatgcatt tgatgccgca 9 60 

gcattcttcc agtctgtggg gaatttcctg ggaatcttcg ctggctcatt tgcaatgggg 1020 

tctgcgtatg ccatcatcac agcactgttg accaaattta ccaagctgtg tgagttcccg 1080 

atgctggaaa ccggcctgtt tttcctgctt tcttggagtg ccttcctgtc tgccgaggct 1140 

gccggcctaa cagggatagt tgctgttctc ttctgtggag tcacacaagc acattatacc 1200 

tacaacaatc tgtcttcgga ttccaaaata agaactaaac agttgtttga atttatgaac 12 60 

tttttggcgg agaacgtcat cttctgttac atgggcctgg cactgttcac gttccagaat 1320 

catatcttta atgctctttt tatacttgga gcctttctag caatttttgt tgccagagcc 1380 
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tgcaacatat atcccctctc cttcctcctg aatctaggcc gaaaacagaa gatcccctgg 1440 
aactttcagc acatgatgat gttttcaggt ttgcgaggag cgatcgcatt tgccttagct 15 00 
attcggaaca cagaatctca gcccaaacaa atgatgttta ccactacgct gctcctcgtg 1560 
ttcttcactg tctgggtatt tggaggagga acaaccccca tgttgacttg gcttcagatc 1620 
agagttggcg tggacctgga tgaaaatctg aaggaggacc cctcctcaca acaccaggaa 1680 
gcaaataact tggataaaaa catgacgaaa gcagagagtg ctcggctctt cagaatgtgg 1740 
tatagctttg accacaagta tctgaaacca attttaaccc actctggtcc tccgctgact 1800 
acaacattac ctgaatggtg tggtccgatt tccaggctgc ttaccagtcc tcaagcctat 1860 
ggggaacagc taaaagagga tgatgtggaa tgcattgtaa accaggatga actagccata 192 0 
aattaccagg agcaagcctc ctcaccctgc agtcctcctg caaggctagg tctggaccag 1980 
aaagcttcac cccagacgcc aggcaaggaa aacatttatg agggagacct cggccctggg 2 040 
aggctatgaa ctcaagcttg agcaaacttt gggtcaatcc cagttgaatt aattggcatg 2100 
aagagtacag atgtaatcac aagtaatgca agactcactg aggaatacaa gccaagctga 2160 
tgaggcagta caggggagag gctggaaaac atattaagag cataaattgg agagaatcaa 222 0 
agccttgtca catggatcct ctggtgcctg aagaaatgag attttattat ccctctctat 2280 
tatgcaaatg aatttagttt tttgacagca gccattctga ttactggatt ggctggggtg 2340 
gggatggagg tatcaggagt ctagctgctg gaggatggga cagctgtgct gggtcttcag 2400 
ggcatttctg ctgcgaatgc ggctctccag gcccttcact tctattctgg attttattcc 2460 
ctccattaag gagagtttaa aaataaaaga aagcttctga gagtaaacat tttgctccta 2520 
agctgaaggg aatgcccagc tatttagtaa gtgataagtt tcttattttg aggacttgac 2580 
tcccatttgc tctcagtgac cccagggcag agcccagaga agtgttccgt acccactgct 2 640 
gatggtttcc cagagcccac actgagttga agaacctatt gttcttcttg gcatccttct 27 00 
tatgctactt ctcccatcgc tcaaaggggt tgcctatggc tgggtgtgcc ctgccctaaa 27 60 
tgcagcacca ctttcaagca gcttctagct atagctttcc accaggtatt tttaatccca 2820 
tttcacctcc tcccccagca attcaccagt caggagtgat ttttactgta aagatggttg 2880 
cttagtaaaa 289 0 

<210> 46 

<211> 3926 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2944004CB1 

<400> 46 

ctggaccttt aatccactgt aggtatggac agggaagaaa ggaagaccat caatcagggt 60 

caagaagatg aaatggagat ttatggttac aatttgagtc gctggaagct tgccatagtt 12 0 

tctttaggag tgatttgctc tggtggggtt tctcctcctc ctctctattg gatgcctgag 180 

tggcgggtga aagcgacctg tgtcagagct gcaattaaag actgtgaagt agtgctgctg 240 

aggactactg atgaattcaa aatgtggttt tgtgcaaaaa ttcgcgttct ttctttggaa 3 00 

acttacccag tttcaagtcc aaaatctatg tctaataagc tttcaaatgg ccatgcagtt 3 60 

tgtttaattg agaatcccac tgaagaaaat aggcacagga tcagtaaata ttcacagact 420 

gaatcacaac agattcgtta tttcacccac catagtgtaa aatatttctg gaatgatacc 48 0 

attcacaatt ttgatttctt aaagggactg gatgaaggtg tttcttgtac gtcaatttat 540 

gaaaagcata gtgcaggact gacaaagggg atgcatgcct acagaaaact gctttatgga 600 

gtaaatgaaa ttgctgtaaa agtgccttct gtttttaagc ttctaattaa agaggttctc 660 

aacccatttt acattttcca gctgttcagt gttatactgt ggagcactga tgaatactat 72 0 

tactatgctc tagctattgt ggttatgtcc atagtatcaa tcgtaagctc actatattcc 780 

attagaaagc aatatgttat gttgcatgac atggtggcaa ctcatagtac cgtaagagtt 840 

tcagtttgta gagtaaatga agaaatagaa gaaatctttt ctaccgacct tgtgccagga 900 

gatgtcatgg tcattccatt aaatgggaca ataatgcctt gtgatgctgt gcttattaat 960 

ggtacctgca ttgtaaacga aagcatgtta acaggagaaa gtgttccagt gacaaagact 1020 

aatttgccaa atccttcagt ggatgtgaaa ggaataggag atgaattata taatccagaa 1080 

acacataaac gacatacttt gttttgtggg acaactgtta ttcagactcg tttctacact 1140 

ggagaactcg tcaaagccat agttgttaga acaggattta gtacttccaa aggacagctt 1200 

gttcgttcca tattgtatcc caaaccaact gattttaaac tctacagaga tgcctacttg 12 60 

tttctactat gtcttgtggc agttgctggc attgggttta tctacactat tattaatagc 13 2 0 

attttaaatg aggtacaagt tggggtcata attatcgagt ctcttgatat tatcacaatt 13 80 

actgtgcccc ctgcacttcc tgctgcaatg actgctggta ttgtgtatgc tcagagaaga 1440 

ctgaaaaaaa tcggtatttt ctgtatcagt cctcaaagaa taaatatttg tggacagctc 1500 

aatcttgttt gctttgacaa gactggaact ctaactgaag atggtttaga tctttggggg 1560 

attcaacgag tggaaaatgc acgatttctt tcaccagaag aaaatgtgtg caatgagatg 1620 

ttggtaaaat' cccagtttgt tgcttgtatg gctacttgtc attcacttac aaaaattgaa 1680 

ggagtgctct ctggtgatcc acttgatctg aaaatgtttg aggctattgg atggattctg 1740 

gaagaagcaa ctgaagaaga aacagcactt cataatcgaa ttatgcccac agtggttcgt 1800 
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cctcccaaac aactgcttcc tgaatctacc cc tgcaggaa accaagaaat ggagctgttt 1860 

gaacttccag ctacttatga gataggaatt gttcgccagt tcccattttc ttctgctttg 1920 

caacgtatga gtgtggttgc cagggtgctg ggggatagga aaatggacgc ctacatgaaa 1980 

ggagcgcccg aggccattgc cggtctctgt aaacctgaaa cagttcctgt cgattttcaa 2040 

aacgttttgg aagacttcac taaacagggc ttccgtgtga ttgctcttgc acacagaaaa 2100 

ttggagtcaa aactgacatg gcataaagta cagaatatta gcagagatgc aattgagaac 2160 

aacatggatt ttatgggatt aattataatg cagaacaaat taaagcaaaa aacccctgca 222 0 

gtacttgaag atttgcataa agccaacatt cgcaccgtca tggtcacagg tgacagtatg 2280 

ttgactgctg tctctgtggc cagagattgt ggaatgattc tacctcagga taaagtgatt 2340 

attgctgaag cattacctcc aaaggatggg aaagttgcca aaataaattg gcattatgca 2400 

gactccctca cgcagtgcag tcatccatca gcaattgacc cagaggctat tccggttaaa 2460 

ttggtccatg atagcttaga ggatcttcaa atgactcgfct atcattttgc aatgaatgga 2520 

aaatcattct cagtgatact ggagcatttt caagaccttg ttcctaagtt gatgttgcat 2580 

ggcaccgtgt ttgcccgtat ggcacctgat cagaagacac agttgataga agcattgcaa 2640 

aatgttgatt attttgttgg gatgtgtggt gatggcgcaa atgattgtgg tgctttgaag 2700 

agggcacacg gaggcatttc cttatcggag ctcgaagctt cagtggcatc tcccttta.cc 2760 

tctaagactc ctagtatttc ctgtgtgcca aaccttatca gggaaggccg tgctgcttta 2820 

ataacttcct tctgtgtgtt taaattcatg gcattgtaca gcattatcca gtacttcagt 2880 

gttactctgc tgtattctat cttaagtaac ctaggagact tccagtttct cttcattgat 2940 

ctggcaatca ttttggtagt ggtatttaca atgagtttaa atcctgcctg gaaagaactt 3000 

gtggcacaaa gaccaccttc gggtcttata tctggggccc ttctcttctc cgttttgtct 3 060 

cagattatca tctgcattgg atttcaatct ttgggttttt tttgggtcaa acagcaacct 312 0 

tggtatgaag tgtggcatcc aaaatcagat gcttgtaata caacaggaag cgggttttgg 3180 

aattcttcac acgtagacaa tgaaaccgaa cttgatgaac ataatataca aaattatgaa 3240 

aataccacag tgttttttat ttccagtttt cagtacctca tagtggcaat tgccttttca 3300 

aaaggaaaac ccttcaggca accttgctac aaaaattatt tttttgtttt ttctgtgatt 33 60 

tttttatata tttttatatt attcatcatg ttgtatccag ttgcctctgt tgaccaggtt 3420 

cttcagatag tgtgtgtacc atatcagtgg cgtgtaacta tgctcatcat tgttcttgtc 3480 

aatgcctttg tgtctatcac agtggagaac ttcttccttg acatggtcct ttggaaagtt 3540 

gtgttcaacc gagacaaaca aggagagtat cggttcagca ccacacagcc accgcaggag 3 600 

^tcagtggatc ggtggggaaa atgctgctta ccctgggccc tgggctgtag aaagaagaca 3 660 

ccaaaggcaa agtacatgta tctggcgcag gagctcttgg ttgatccaga atggccacca 3720 

aaacctcaga caaccacaga agctaaagct ttagttaagg agaatggatc atgtcaaatc 3780 

atcaccataa catagcagtg aatcagtctc agtggtattg ctgatagcag tattcaggaa 3840 

tatgtgattt taggagtttc tgatcctgtg tgtcagaatg gcactagttc agtttatgtc 3900 

ccttctgata tagtagctta tttgac 3 926 

<210> 47 

<211> 2135 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<223> Incyte ID No: 3046849CB1 

<400> 47 

cgctcaggcc cctctttcga atgctccacg ccctcctgcg atctagaacg attcagggca 60 
ggatcctgct cctgaccatc tgcgctgccg gcattggtgg gacttttcag tttggctata 12 0 
acctctctat catcaatgcc ccgaccttgc acattcagga attcaccaat gagacatggc 180 
aggcgcgtac tggagagcca ctgcccgatc acctagtcct gcttatgtgg tccctcatcg 240 
tgtctctgta tcccctggga ggcctctttg gagcactgct tgcaggtccc ttggccatca 3 00 
cgctgggaag gaagaagtcc ctcctggtga ataacatctt tgtggtgtca gcagcaatcc 3 60 
tgtttggatt cagccgcaaa gcaggctcct ttgagatgat catgctggga agactgctcg 420 
tgggagtcaa tgcaggtgtg agcatgaaca tccagcccat gtacctgggg gagagcgccc 480 
ctaaggagct ccgaggagct gtggccatga gctcagccat ctttacggct ctggggatcg 540 
tgatgggaca ggtggtcgga ctcagggagc tcctaggtgg ccctcaggcc tggcccctgc 600 
tgctggccag ctgcctggtg cccggggcgc tccagctcgc ctccctgcct ctgctccctg 660 
aaagcccgcg ctacctcctc attgactgtg gagacaccga ggcctgcctg gcagcactac 720 
ggcagctacg gggctccggg gacttggcag gggagctgga ggagctggag gaggagcgcg 780 
ctgcctgcca gggctgccgt gcccggcgcc catgggagct gttccagcat cgggccctga 840 
ggagacaggt gacaagcctc gtggttctgg gcagtgccat ggagctctgc gggaatgact 9 00 
cggtgtacgc ctacgcctcc tccgtgttcc ggaaggcagg agtgccggaa gcgaagatcc 9 60 
agtacgcgat catcgggact gggagctgcg agctgctcac ggcggttgtt agttgtgtgg 102 0 
taatcgagag ggtgggtcgg cgcgtgctgc tcatcggtgg gtacagcctg atgacctgct 1080 
gggggagcat cttcactgtg gccctgtgcc tgcagagctc cttcccctgg acactctacc 1140 
tggccatggc ctgcatcttt gccttcatcc tcagctttgg cattggccct gccggagtga 1200 
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cggggatcct 

gcggggcgct 
aggccttgtc 
acactggcct 
tacacagact 
tccagtcaac 
gtcctctgct 
gtgtttattg 
agcgtggtat 
aggctggttg 
atcaattgag 
caaaaaattt 
gaggctgagg 
ttcacaccac 
taataataat 
tgaggtggtg 



ggccacagag 
catgtggatc 
ccacttcctc 
gttccttcct 
caacttcccc 
agaactctag 
tcctgccagg 
agcacccttt 
tccaggctaa 
tggcagctca 
gccagagttt 
taaacattag 
cagcaggatc 
tgcatgccca 
aattacggac 
atcccgtatc 



ctgtttgacc 
atgctcatcc 
tatgtccctt 
gagaccaaag 
aggcgggccc 
tcccaaaggg 
gccctggtcc 
gtgtgcagac 
aggtaattaa 
tgactgtaat 
gaaaccagcc 
ctgggcatgg 
acttgagtcc 
gactggggtg 
tccggaaatg 
ccagccttgg 



agatggccag 
tggtcggcct 
tccttggtgt 
gcaagacctt 

agggccccac 

gtggccagag 
tcactccctc 
atggctccag 
ctgacagaaa 
cccagcactt 
taggtaacat 
tggtatgtgc 
caagagttca 
acagagggag 
cgttgacaac 
gaagc 



gcctgc tgcc 
gggatttccc 
ctgtgtctgt 
ccaagagatc 
gtggaggagc 
ccaaagccag 
ctgcat tcct 
gtgcttagca 
atcagtaaca 
tgggaggcca 
agtgagaccc 
taacagctct 
aggtagcagt 
acttcatctc 
gaaacatacc 



tgcatggtct 
tttatcatgg 
ggggccatct 
tccaaggaat 
ctggaggtta 
c tactgtcct 
catttaagga 
atcaatggtg 
acataattac 
aggtgggagg 
cctatctcta 
agctactcag 
aagctaacaa 
tttaaaaaca 
ggtggccccg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2135 



<210> 48 

<211> 2637 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 45383 63CB1 



<400> 48 

atgggctgga 

ccctttctga 

acagccatgg 

gctgacatca 

tcttgtcggg 

tggtggccga 

ctggcgggct 

tacgtgctgc 

accttacctg 

gtcctgtccc 

ctgtttgtgc 

atcacagccc 

cagacgctca 

ggtggttacg 

aacaccacct 

ggggacctgc 

tgcaccgacc 

gcgggctcca 

ggcatgatca 

ctgcgggcct 

gaactgatgc 

tcgtcgctga 

cggctgcgtc 

gcactcatcg 

ctcttcatct 

gtcctgggcg 

gggctggtgg 

ggagagccag 

gccctctttg 

cagagtgtcc 

actaaagcag 

ttcaatgcca 

ctgccatcct 

ggggatcccg 

ccaagaactg 
gaaacataag 
gcgacaaact 
ctaccaacac 
cagaaaggca 
atgccaaaac 



gatgccactg 
cctggtttgc 
ccgccaactc 
tcgtcatcac 
ccagtaggaa 
ttggagcctc 
caggcgcggc 
tggcactggc 
agtacattca 
tgctactgtc 
acatctgcct 
tgtacaccat 
tcatggtggt 
ggcagctgga 
gccacctgcc 
cgtggaccgg 
aggtcatcgt 
tcctggccag 
gccgcgcatt 
gcggggccga 
ccatcggtct 
cctccatctt 
cccgctccgg 
gcgtgagtgt 
acatgcagtc 
tcttctggcg 
tgggggccac 
acacgcggcc 
cactcagtgg 
agattgagaa 
gtgatggcca 
tcctcctcat 
ggacagaaag 
aggccccaag 
gccaagccag 
tgtgactttt 
tagattttcc 
agctcatcac 
cccccccact 
gacacaagcc 



tccgcttggt 
ccctcagtcc 
caccagcgac 
tgtgtatttt 
cacggtgaat 
cctcttcgcc 
aggaggtctg 
atgggtgttc 
gaagcgctac 
tgtcttcacc 
gggctggaac 
cgcagggggc 

gggggctgtc 

ggcagcctac 
acgtacagac 
gatgaccttt 
gcagcgatca 
ctacctcaag 
gttcccagat 
ggtcggctgc 
gcgggggctg 
caacagcagc 
cgagcgggag 
ggcctggatc 
agtgaccagc 
acgtgccaac 
gaggctggtc 
agccgtcctg 
tgctgttgtg 
ccttacctgg 
aacaccccag 
gtgtgtcaac 
gcaggagctc 

aggggcagat 

caaagcggga 
tccaaacaac 
tgaccgggcc 
catactcaca 
ctcaagacgc 
aggacacggc 



ttaatgatca 
ctcgggctca 
ctccacactc 
gctctgaacg 
ggctacttcc 
agcagcgagg 
gccgtggcag 
gtgcccatct 

gggggccagc 

aagatatcgc 
ttctacctct 
ctggctgctg 
atcctgacaa 
gcccaggcca 
gccatgcaca 
ggcctgacca 
ctgtcagccc 
atgctcccca 
gatgtgggct 
tccaacatcg 
atgatcgcag 
agcaccctct 
ctcctgctgg 
cccgtcctgc 
tccctggccc 
gagcaggggg 
ctggaattcc 
gggagcatcc 
gtggctggaa 
tggaccctgg 
aaacacgcct 
atattctttt 
tgagtcctca 
tcccctcaca 
gcctgaaaac 
agcacccaaa 
caccacaccc 
ccacccacag 
gacggcgcaa 
actgtataca 



atgagctccc 
tacctagtgc 
ccgggacgca 
tggccgtggg 
tggcaggccg 
gctctggcct 
gcttcgagtg 
acatctcctc 
ggatccgcat 
tggacctgta 
ccaccatcct 
taatctacac 
tcaaagcttt 
ttccctccag 
tgtttcgaga 
tcatggccac 
gggac ctgaa 
tgggcctgat 
gcgtggtgcc 
cctaccccaa 
tgatgctggc 
tcactatgga 
tgggacggct 
aggactccaa 
caccagtgac 
ccttctgggg 
tgaacccagc 
actacctgca 
gcctgc tgac 
ctcaggatgt 
tctgggcccg 
atgcctactt 
ggtccaccca 
gctgcacagc 
attagggggg 
gcaagtcaag 
caacctcctc 
cggcccgccc 
tcgactgcaa 
gcacgagggt 



tgccaggaaa 
ctgcggcagg 
gctgagcgtg 
catatggtcc 
ggacatgacg 
cttcattgga 
gaatgccacg 
agagatcgtc 
gtacctgtct 
cgcgggggct 
cacgctcggc 
ggacgccctg 
tgaccagatc 
gaccattgcc 
cccccacaca 
ctggtactgg 
ccatgccaag 
catcatgccg 
gtccgagtgc 
gctggtcatg 
ggcgctcatg 
catctggagg 
ggtcatagtg 
cagcgggcaa 
tgcagtcttt 
cctgatagca 
cccaccgtgc 
cttcgctgtc 
cccaccccca 
gcccttggga 
tgtctgtggc 
cgcctgaaca 
tttccctcat 
agctcggtgc 
aaactgggac 
catttggaac 
acctcccaaa 
ccactccaat 
ctccataacg 
gatctgcaac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

114 0 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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gttgtggccg aatgcagaaa atacactggg tgctggcgta aggaagatcc gcgagtaaac 2460 
aacggtcttg taaacttact gcatccacca aggtacactt ccagaacgag accagacaac 2520 
tacactccac acaacctgca gccacaccct atttctgcta tcataaagag cccccgcacc 2580 
acataataat gccggcagac tcagtgcgcg aaacccttgt gctggacttc accacgg 2637 



<210> 49 

<211> 3783 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> incyte ID No: 6427460CB1 



<400> 49 

gcactagtac cccggagccc atgggcgcgc cgagccgggc gcgggggcgc tgaacggcgg 60 
agcgggagcg gccggaggag ccatggactg cagcctcgtg cggacgctcg tgcacagata 12 0 
ctgtgcagga gaagagaatt gggtggacag caggaccatc tacgtgggac acagggagcc 180 
acctccgggc gcagaggcct acatcccaca gagataccca gacaacagga tcgtctcgtc 240 
caagtacaca ttttggaact ttatacccaa gaatttattt gaacaattca gaagagtagc 3 00 
caacttttat ttccttatca tatttctggt gcagttgatt attgatacac ccacaagtcc 360 
agtgacaagc ggacttccac tcttctttgt cattactgtg acggctatca aacagggtta 420 
tgaagactgg cttcgacata aagcagacaa tgccatgaac cagtgtcctg ttcatttcat 480 
tcagcacggc aagctcgttc ggaaacaaag tcgaaagctg cgagttgggg acattgtcat 540 
ggttaaggag gacgagacct ttccctgcga cttgatcttc ctttccagca accggggaga 600 
tgggacgtgc cacgtcacca ccgccagctt ggatggagaa tccagccata aaacgcatta 660 
cgcggtccag gacaccaaag gcttccacac agaggaggat atcggcggac ttcacgccac 720 
catcgagtgt gagcagcccc agcccgacct ctacaagttc gtgggtcgca tcaacgttta 780 
cagtgacctg aatgaccccg tggtgaggcc cttaggatcg gaaaacctgc tgcttagagg 840 
agctacactg aagaacactg agaaaatctt tggtgtggct atttacacgg gaatggaaac 9 00 
caagatggca ttaaattatc aatcaaaatc tcagaagcga tctgccgtgg aaaaatcgat 9 60 
gaatgcgttc ctcattgtgt atctctgcat tctgatcagc aaagccctga taaacactgt 1020 
gctgaaatac gtgtggcaga gtgagccctt tcgggatgag ccgtggtata atcagaaaac 1080 
ggagtcggaa aggcagagga atctgttcct caaggcattc acggacttcc tggccttcat 1140 
ggtcctcttt aactacatca tccctgtgtc catgtacgtc acggtcgaga tgcagaagtt 1200 
cctcggctct tacttcatca cctgggacga agacatgttt gacgaggaga ctggcgaggg 1260 
gcctctggtg aacacgtcgg acctcaatga agagctggga caggtggagt acatcttcac 1320 
agacaagacc ggcaccctca cggaaaacaa catggagttc aaggagtgct gcatcgaagg 13 80 
ccatgtctac gtgccccacg tcatctgcaa cgggcaggtc ctcccagagt cgtcaggaat 1440 
cgacatgatt gactcgtccc ccagcgtcaa cgggagggag cgcgaggagc tgtttttccg 1500 
ggccctctgt ctctgccaca ccgtccaggt gaaagacgat gacagcgtag acggccccag 1560 
gaaatcgccg gacgggggga aatcctgtgt gtacatctca tcctcgcccg acgaggtggc 1620 
gctggtcgaa ggtgtccaga gacttggctt tacctaccta aggctgaagg acaattacat 1680 
ggagatatta aacagggaga accacatcga aaggtttgaa ttgctggaaa ttttgagttt 1740 
tgactcagtc agaaggagaa tgagtgtaat tgtaaaatct gctacaggag aaatttatct 1800 
gttttgcaaa ggagcagatt cttcgatatt cccccgagtg atagaaggca aagttgacca 18 60 
gatccgagcc agagtggagc gtaacgcagt ggaggggctc cgaactttgt gtgttgctta 1920 
taaaaggctg atccaagaag aatatgaagg catttgtaag ctgctgcagg ctgccaaagt 1980 
ggcccttcaa gatcgagaga aaaagttagc agaagcctat gagcaaatag agaaagatct 2 040 
tactctgctt ggtgctacag ctgttgagga ccggctgcag gagaaagctg cagacaccat 2100 
cgaggccctg cagaaggccg ggatcaaagt ctgggttctc acgggagaca agatggagac 2160 
ggccgcggcc acgtgctacg cctgcaagct cttccgcagg aacacgcagc tgctggagct 2220 
gaccaccaag aggatcgagg agcagagcct gcacgacgtc ctgttcgagc tgagcaagac 2280 
ggtcctgcgc cacagcggga gcctgaccag agacaacctc tccggacttt cagcagatat 2340 
gcaggactac ggtttaatta tcgacggagc tgcactgtct ctgataatga agcctcgaga 2400 
agacgggagt tccggcaact acagggagct cttcctggaa atctgccgga gctgcagcgc 2460 
ggtgctctgc tgccgcatgg cgcccttgca gaaggctcag attgttaaat taatcaaatt 2520 
ttcaaaagag cacccaatca cgttagcaat tggcgatggt gcaaatgatg tcagcatgat 2580 
tctggaagcg cacgtgggca taggtgtcat cggcaaggaa ggccgccagg ctgccaggaa 2 640 
cagcgactat gcaatcccaa agtttaagca tttgaagaag atgctgcttg ttcacgggca 2700 
tttttattac attaggatct ctgagctcgt gcagtacttc ttctataaga acgtctgctt 2760 
catcttccct cagtttttat accagttctt ctgtgggttt tcacaacaga ctttgtacga 2820 
caccgcgtat ctgaccctct acaacatcag cttcacctcc ctccccatcc tcctgtacag 2880 
cctcatggag cagcatgttg gcattgacgt gctcaagaga gacccgaccc tgtacaggga 2 94 0 
cgtcgccaag aatgccctgc tgcgctggcg cgtgttcatc tactggacgc tcctgggact 3 000 
gtttgacgca ctggtgttct tctttggtgc ttatttcgtg tttgaaaata caactgtgac 3 060 
aagcaacggg cagatatttg gaaactggac gtttggaacg ctggtattca ccgtgatggt 3120 
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gttcacagtt 
tgtcatctgg 
ctggccgttc 
gcccgcctgg 
gaaagtcctg 
cgcacagcct 
ctaccagagc 
cgcggcctgg 
ccactgcagt 
atggatggtc 
aggagggggg 
tgt 



acactaaagc 
gggtcgctgc 
ctcaactacc 
ctggccatcg 
tgccggcagc 
cgggaccgcg 
acctgtccct 
aaggagaagg 
tccatcccaa 
ctaagcctgt 
tcacaggcct 



ttgcattgga 
tgttctacgt 
agaggatgta 
tgctgctggt 
tgtggccaac 
actcagaatt 
cggccgcctg 
tgtccacgga 
gtcacagctg 
ggagactgtg 
tgccctcgaa 



cacacactac 
tgtcttt tea 
ctacgtgttc 
gaccatcagc 
agcaacagag 
cacccctctt 
gtacagctcc 
gcccccaccc 
ccctaggtcc 
cacgtgcctc 
catggcaccc 



tggacttgga 
cttctctggg 
atecagatge 
ctccttcccg 
agagtccagc 
gcctctctgc 
cactctcagc 
atcctcggcg 
cgtgtgggaa 
ttcctggccc 
tggccgcctg 



tcaaccattt 
gaggagtgat 
tgtccagegg 
acgtcctcaa 
agaatgggtg 
agageccagg 
aggtgacact 
gttcccatca 
tgctcgtgtg 
ccagcaggca 
gacccagcac 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3783 



<210> 50- 

<211> 2105 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7474127CB1 



<400> 50 

ccagcgccca 

cgtcggagaa 

gtccactcgg 

tcacagagat 

ggcacgaaac 

cctcctccga 

cgcctccact 

tegagggegg 

gcggccgcga 

accgcaccgg 

tggccttctg 

agcaccgcga 

accccggcga 

tcgggggccc 

tcttcgaaga 

tcatcctggt 

aaaacaagac 

cagatcctgc 

tagtcegtat 

ttgactttgt 

aagctgctaa 

ttttcaagct 

gtactaatga 

ccatgatcta 

acacacagtt 

tgggttatgg 

ctctggctgg 

tgtactactc 

ctcctgctcc 

atagtacaca 

cagtgttatc 

tccccatcag 

tgacgacagg 

cccgaagctt 

catcacccta 

tgtaa 



gggaagegge 
gggeggctea 
aaaccaaagc 
gggcaagatc 
ctaccgcagc 
gcccccaggc 
gtcgccgccg 
cgcgggcaac 
gttcttcttc 
caagctgcac 
gggcatcgac 
cgecgaggag 
cgacgaggac 
egaeggcaaa 
cccctactcg 
ttcaattaca 
agaaccagtc 
ettgaegtat 
tgttttttca 
ggccatccta 
agatgtgctt 
cacccgccat 
atttttgetg 
etatgecgag 
caaaaacatt 
ggatatgtac 
agtgctgaca 
cttggcaatg 
tcaggcaagc 
gagtgacaca 
aggtgacgac 
aegctctagfc 
tgattacacg 
aaacaacata 
caactctcct 



tcaaccacct 
cctgggcgcc 
agaaccactt 
gagaacaacg 
accctcaaga 
gaetgettga 
ccgagagcgc 
tgcagttccc 
gaccggcacc 
tgccccgcag 
gagaccgacg 
gcgctggaca 
ctggcggcca 
tctggccgct 
tccagagccg 
actttttgee 
atcaatggca 
gtagaaggag 
cccaataaac 
cctttctact 
ggcttcctca 
tttgtaggtc 
ctgataattt 
agagtgggag 
cccattgggt 
ccccaaacat 
atagecatge 
gcaaagcaga 
tcacctactt 
tgtctgggca 
agtacaggaa 
accagagaca 
tgtgcttctg 
gcgggcttgg 
tgtcctctga 



gaatccggaa 
aagactcagt 
ttctctcggt 
agagggtgat 
ccctgcctgg 
ccacggcggg 
ccccgctgtc 
gcggcggcag 
egggegtett 
acgtgtgcgg 
tggagccctg 
tcttcgagac 
agaggctggg 
ggaggaggct 
ccaggtttat 
tggaaacaca 
caagtgttgt 
tgtgtgtggt 
ttgaattcat 
tagaggtggg 
gggtggtaag 
tgagggtgct 
tcctggctct 
ctcaacctaa 
tctggtgggc 
ggtcaggcat 
cagtgcctgt 
aacttccaag 
tttgeaagae 
aagacaatcg 
gtgagccgcc 
aaaacagaag 
atggagggat 
caggcaatgc 
ggcgctctcg 



aacgccaaca 
cccgctgccc 
ctcgttaagt 
cctcaatgtc 
aacacgcctg 
cgacaagctg 
ccccgggcca. 
ggccagcgac 
cgcctatgtg 
gccgctcttc 
ctgctggatg 
ccccgacctc 
catcgaggac 
gcagccccgc 
tgettttget 
tgaagctttc 
tctacaatat 
gtggtttact 
caaaaatctc 
actcagtggg 
gtttgtgagg 
tggacatact 
aggagttttg 
cgacccttca 
tgtagtgacc 
gctggtggga 
cattgtcaat 
gaaaagaaag 
agaattaaat 
acttctggaa 
actatcaccc 
aggggaaaca 
caggaaaggt 
tctgaggctc 
atctcccatc 



agtagtttct 
agagaacctc 
catgtctgag 
gggggcac cc 
gcccttcttg 
cagccgtcgc 
ggeggctget 

catcccggtg 
ctcaattact 
gaggaggagc 
acctaccggc 
attggcggcg 
geggegggge 
atgtgggccc 
tctttattct 
aatattgtta 
gaaattgaaa 
tttgaatttt 
ttgaatatca 
ctgtcatcca 
atcctgagaa 
cttcgagcta 
atatttgeta 
gctagtgagc 
atgactaccc 
gccctgtgtg 
aattttggaa 
aagcacatcc 
atggcctgca 
cataacagat 
ccagaaaggc 
tgtttcctac 
tatgaaaaat 
tctccagtaa 
ccatctatct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2105 



<210> 51 

<211> 2069 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
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<223> Incyte ID No : 7476949CB1 



<400> 51 

atgagcaagg 

acagctccac 

gtggtcatct 

cgagggacca 

gcatctctga 

gctgccggag 

cttggctggg 

ctgaagaagc 

ctctacatc t 

gcattgggct 

accattgcag 

gtagggggag 

ctggagcagc 

ctcccacggc 

ccaggtctca 

attgtgcagc 

gggggctacc 

gccctgttcc 
gcccgagtgg 
ggtctgcggg 
atct tcaaca 
tcaacagagc 
agcatcctct 
caggctgtca 
tgcaagaggg 
cttctgcgta 
aggccagcag 
actgccatcg 
acacgcctca 
gagagcacac 
gcagagaact 
ttgctctgga 
gagaaggctg 
gtctgcaaca 
gcgtgattca 



acctggcagc 
acatagcact 
actttgtctt 
ttggcggcta 
tgtccagcaa 
gccttgccgt 
tcttcgtccc 
gatttggggg 
tcaccaagat 
ggaacctgta 
gtggcctcat 
ccctggtcct 
ggtacaggca 
ccgatgcttt 
ttttcgggct 
ggtctctctc 
tgaagatcct 
cagacgaggt 
gatgttccaa 
ggctgatgat 
gcagcagcac 
aggagctgat 
ggatccccat 
ccagttacct 
tcacagagcc 
tgatcctgga 
tgctgaagga 
tcattgtcat 
catggtggac 
cggagatatc 
cgagcctggg 
gctggttctg 
cgctagaaca 
tcaatgctgt 
aacctggctt 



aatggggcct 
ggactccaga 
cgtcattgct 
tttcctggcc 
tgtgggcagt 
aggtggcttc 
tgtgtacatc 
ccagaggatc 
ctcgactgac 
cctctccaca 
ggccgtgatc 
catgtttctg 
ggccatccct 
ccacattctt 
cacagtgctg 
ggccaagagt 
ccccatgttc 
gggctgcgtg 
cattgcctac 
tgccgtgatc 
cctgttcacc 
ggtggtgggc 
catccaaagc 
ggccqcaccc 
cggagctttc 
gttctcatac 
cttccactac 
tgtcagcctc 
tcggaactgc 
cgagaggcca 
ccaggagcag 

tgggctctct 

gaagctgaca 
ccttttgctg 
cactgtaga 



ggagcttcag 
gttggtctgc 
gtggggatct 
gggaggtcca 
ggcttgttca 
gagtggaacg 
gcagcaggtg 
caggtgtaca 
atcttctctg 
gggatcctgc 
tacacagatg 
ggctttcagg 
aatgtcacag 
cgggaccctg 
gccacctggt 
ctgtctcatg 
ttcatcgtca 
gaccctgatg 
cctaagttgg 
atggccgctc 
attgatgtgt 
agagtgtttg 
tccaacagtg 
atcaccgctc 
tggggcctcg 
ccagcgccag 
ctgtactttg 
tgtacaactc 
cccctctctg 
gccggggagt 
cctgaagccc 
ggaacaccgg 
agcattgagg 
gccatcaaca 



gggacggggt 
acgcctacga 
ggtcgtccat 
tgagctggtg 
tcggcctggc 
caacctggct 
tggtcacaat 
tgtctgtcct 
gagccctctt 
tggtggtgac 
ctctgcagac 
acgtgggctg 
tccccaacac 
tgagcgggga 
gttggtgcac 
ccaagggagg 
tgcctggcat 
tctgccaaag 
tcatggccct 
tcatgagctc 
ggcagcgctt 
tggtgttcct 
ggcagctctt 
tcttcctgct 
tgtttggcct 
cctgtgggga 
caatcctcct 
ccatccctga 
agctggagaa 
gccctgcagg 
caagcaggtc 
agcaggccct 
aggagccact 
tcttcctctg 



caggactgag 
catcagcgtg 
ccgtgcaagt 
gccaattgga 
tgggacaggg 
gctcctggcc 
gccgcagtat 
gtctctcatc 
catccagatg 
tgccgtctac 
ggtgatcatg 
gtacccaggc 
cacctgtcac 
catcccttgg 
agaccaggtc 
ctccgtgctg 
gatcagccgg 
aatctgtggg 
catgcctgtt 
actcacctcc 
ccgcaggaag 
ggttgtcatc 
cgactacatc 
ggccatcttc 
gggagtgggg 
ggtggaccgg 
ctgcgggctc 
ggaacagctc 
ggaggcccac 
aggtggagcg 
ctggggaaag 
gagcccagca 
ctggagacat 
gggctatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2069 



<210> 52 

<211> 4245 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 7477249CB1 



<400> 52 
gcggcggcag 

ggggccgacg 

gtaagcggaa 
cctgcagccc 
gcgctggatc 
ctacgtagcc 
caggatcatt 
gttcagaaga 
tacacctacc 
cataaagcag 
tcctgtttat 

gggtgatatt 

ctcagatcga 
cctgaagaca 
cactctagta 
acgaatgatc 
cctgcttcgt 
tggaatggaa 



gctcagctgc 
ccgcgggatg 
cttcggcccg 
cgcggccccc 
cggcagcagc 
aacaggtttc 
tcatctaagt 
gtggcaaact 
agtccagtta 
ggatatgaag 
gttgttcgaa 
gttcgaatag 
ctggatggtt 
catgtggcag 
gctgtaatag 
ataacccaac 
ggagccagat 
actaagatgg 



gccgggcggg 
gggacgcggc 
aggggctcgc 
gcgccccgcg 
tgggttttga 
ctcagaatgg 
acactgtgtg 
tttattttct 
ccagtggact 
attggttacg 
gtggtggcct 
ccaaagatga 
cctgtcacgt 
ttccagaaac 
aatgccagca 
aaatggaaga 
taaaaaacac 
cattaaatta 



ggcggcgctg 
gcggggagtg 
ccgctcccgc 
ggacccggac 
cccaccacat 
cctttacaca 
gaattttgtt 
tattatattt 
tccattattc 
gcataac tea 
tgtaaaaact 
aatttttcct 
tacaactget 
agcattatta 
accagaagca 
aattgtaaga 
aaaagaaatt 
caagagcaaa 



gggccgcgcc 
aggcagtggc 
ctctgtcttg 
ggcgacgacg 
cagagtgaca 
cctcagaaat 
ccaaaaaatt 
ttggttcagc 
tttgtgataa 
gataatgaag 
agatcaaaaa 
gcagacttgg 
agtttggacg 
caaacagttg 
gacttataca 
cctctggggc 
tttggtgttg 
tcacagaaac 



tgtaggactc 
ggcggcggcg 
tcggcctcca 

ggggaatgtg 

caagaaccat 
ttatagataa 
tatttgaaca 
ttatgattga 
cagtaactgc 
taaatggagc 
acattegggt 
tgcttctgtc 
gagaaactaa 
ccaatttgga 
gattcatggg 
eggagagtet 
eggtatacac 
gatctgeagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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agaaaagtca atgaatacat ttttgataat ttatctagta attcttatat ctgaagctgt 1140 
catcagcact atcttgaagt atacatggca agctgaagaa aaatgggatg aaccttggta 12 00 
taaccaaaaa acagaacatc aaagaaatag cagtaaggta gagtacgtgt ttacagataa 12 60 
aactggtaca ctgacagaaa atgagatgca gtttcgggaa tgttcaatta atggcatgaa 132 0 
ataccaagaa attaatggta gacttgtacc cgaaggacca acaccagact cttcagaagg 13 80 
aaacttatct tatcttagta gtttatccca tcttaacaac ttatcccatc ttacaaccag 1440 
ttcctctttc agaaccagtc ctgaaaatga aactgaacta attaaagaac atgatctctt 1500 
ctttaaagca gtcagtctct gtcacactgt acagattagc aatgttcaaa ctgactgcac 15 6 0 
tggtgatggt ccctggcaat ccaacctggc accatcgcag ttggagtact atgcatcttc 1620 
accagatgaa aaggctctag tagaagctgc tgcaaggatt ggtattgtgt ttattggcaa 1680 
ttctgaagaa actatggagg ttaaaactct tggaaaactg gaacggtaca aactgcttca 174 0 
tattctggaa tttgattcag atcgtaggag aatgagtgta attgttcagg caccttcagg 1800 
tgagaagtta ttatttgcta aaggagctga gtcatcaatt ctccctaaat gtataggtgg 18 60 
agaaatagaa aaaaccagaa ttcatgtaga tgaatttgct ttgaaagggc taagaactct 192 0 
gtgtatagca tatagaaaat ttacatcaaa agagtatgag gaaatagata aacgcatatt 19 80 
tgaagccagg actgccttgc agcagcggga agagaaattg gcagctgttt tccagttcat 2040 
agagaaagac ctgatattac ttggagccac agcagtagaa gacagactac aagataaagt 2100 
tcgagaaact attgaagcat tgagaatggc tggtatcaaa gtatgggtac ttactgggga 2160 
taaacatgaa acagctgtta gtgtgagttt atcatgtggc cattttcata gaaccatgaa 222 0 
catccttgaa cttataaacc agaaatcaga cagcgagtgt gctgaacaat tgaggcagct 22 80 
tgccagaaga attacagagg atcatgtgat tcagcatggg ctggtagtgg atgggaccag 23 40 
cctatctctt gcactcaggg agcatgaaaa actatttatg gaagtttgca gaaattgttc 2400 
agctgtatta tgctgtcgta tggctccact gcagaaagca aaagtaataa gactaataaa 24 60 
aatatcacct gagaaaccta taacattggc tgttggtgat ggtgctaatg acgtaagcat 2520 
gatacaagaa gcccatgttg gcataggaat catgggtaaa gaaggaagac aggctgcaag 2580 
aaacagtgac tatgcaatag ccagatttaa gttcctctcc aaattgcttt ttgttcatgg 2 640 
tcatttttat tatattagaa tagctaccct tgtacagtat tttttttata agaatgtgtg 27 00 
ctttatcaca ccccagtttt tatatcagtt ctactgtttg ttttctcagc aaacattgta 27 60 
tgacagcgtg tacctgactt tatacaatat ttgttttact tccctaccta ttctgatata 282 0 
tagtcttttg gaacagcatg tagaccctca tgtgttacaa aataagccca ccctttatcg 288 0 
agacattagt aaaaaccgcc tcttaagtat taaaacattt ctttattgga ccatcctggg 2 940 
cttcagtcat gcctttattt tcttttttgg atcctattta ctaataggga aagatacatc 3 000 
tctgcttgga aatggccaga tgtttggaaa ctggacattt ggcactttgg tcttcacagt 3 060 
catggttatt acagtcacag taaagatggc tctggaaact catttttgga cttggatcaa 3120 
ccatctcgtt acctggggat ctattatatt ttattttgta ttttccttgt tttatggagg 3180 
gattctctgg ccatttttgg gctcccagaa tatgtatttt gtgtttattc agctcctgtc 3240 
aagtggttct gcttggtttg ccataatcct catggttgtt acatgtctat ttcttgatat 33 00 
cataaagaag gtctttgacc gacacctcca ccctacaagt actgaaaagg cacagcttac 33 60 
tgaaacaaat gcaggtatca agtgcttgga ctccatgtgc tgtttcccgg aaggagaagc 3420 
agcgtgtgca tctgttggaa gaatgctgga acgagttata ggaagatgta gtccaaccca 3480 
catcagcagg tgtgaaatct ctctaagtag cctttgctgc agatgagtat cctatctgga 3 540 
acaggatgaa cctgccgctc tagataccta ataaatcagc agctggtttt accaactgaa 3 600 
gcaggaagtc tgctatttat tagcactctt tggtggtaga tttcactttg tggctttggg 3 660 
gtaagggctt tttcactcac aaaggaagag aaagcacctt tgaagagact tcatctaatg 372 0 
aacaaaaaat tttgtttcat aatctttcta aaatgtgctc agtaggagtg tgtttatggt 37 80 
actcttttat ggtttgtata actttctttt ttaaattata catatactat ttccttttta 3840 
tttttttaaa atttttttgc tttttgtctt tacaaaataa tctcaacata acagtgaagt 3 9 00 
caaaggcttt ccttttctta ctctgtatgt atattttcca gttggttatt tgaggctttg 3 9 60 
aggtatttat aaacacaaaa ggctgtattt ctgctcccct acctcttctt atgtctgtaa 4020 
tgaagttttg aaatgagtca tgatttttaa gtttcttttg cttggtattt attgcctaat 4 080 
taaaagtgta tgagttagaa caggcttttt aaattatgga gtaaaagaat cttagcattt 414 0 
ttgtcccctc ctaaatctgt ttcttgaatg agatttatca ccatgcctgc tgttgtgcac 42 00 
cataacgaaa aaaaacacct tttggtaaac accatttaaa attca 4245 

<210> 53 

<211> 2124 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7477720CB1 

<400> 53 

atggctctgc agatgttcgt gacttacagt ccttggaatt gtttgctact gctagtggct 60 
cttgagtgtt ctgaagcatc ttctgatttg aatgaatctg caaattccac tgctcagtat 120 
gcatctaacg cttggtttgc tgctgccagc tcagagccag aggaagggat atctgttttt 180 
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gaactggatt 
tcccttgcaa 
tgcctcctca 
tcgcctccgg 
gagggcggc t 
tgggcagtat 
tgccaggtga 
agcctgatct 
aacgagcagc 
gtcttataca 
gtcgacattt 
ggcatcgttt 
attgagccac 
tatctctccg 
gaaaacgtgt 
gtcagcgaga 
tggaactggg 
gtatttgctc 
cagtgcatca 
cttcctctgt 
tactttactg 
aaaaaaacca 
ttaaaggctg 
tttaagaagt 
tcaagcattg 
ac tgggatac 
atcaaaagac 
taccaagttc 
gagaagcagg 
aaaggtcaca 
tacccctacg 
aagctgccca 
gtgtgggcca 



atgactatgt 
aaataggctt 
tcctggtggg 
tcatggactc 
acttcatgcc 
tgggggccct 
aggcctttgg 
ccgccgtgga 
tctacatgat 
atatgttaat 
tggctggatg 
ttggatttat 
tcatcgtctt 
gcatcctggc 
cccagacatc 
ccttgatctt 
ccttcatctg 
tcttctatat 
ttttctacag 
ctctttttcc 
tatttattca 
ataaaaaaga 
gaatcgaaga 
ttgatcatag 
tttctttgta 
tgagctctac 
tttcccctga 
ggcaaaggac 
ctaaagagat 
gcctgccctg 
ggaatcctca 
cctggctgct 
agaggcaaca 



gcaaattcct 
ccacctctac 
ggcgctggtg 
cagcatctac 
cacccggccc 
gatcaacgcc 
cctgggcgac 
cccagtggcc 
gatctttggg 
tgcctttaca 
tgcccgattc 
ttctgcattt 
catgttcagc 
aatcacagcc 
atacacgacc 
catcttcatg 
cttcaccctg 
cagtaaccag 
tggtgttcga 
taggaagaaa 
gggaatcaca 
atccatcaat 
tgtgtgtggg 
atacttacgg 
caagaagctg 
agctttctcc 
agatgtggag 
cctgtcctac 
tctgatccgc 
gggaaagccg 
gtctgcagga 
ctgctgcttt 
ttaa 



tatgaggtca 
ca caggctgc 
ggcggcatca 
ttcctgtatc 
ttctttgaga 
ttgggcattg 
gtcaacctgc 
gtgctagccg 
gaggccctgc 
aagatgcata 
atcgttgtgg 
atcacacgtt 
tatttgtctt 
tgcgcagtaa 
atcaagtact 
ggtgtgtcca 
gccttctgcc 
tttcggactt 
ggagctggaa 
atgtttgtca 
gttggccctc 
gaagagcttc 
cactggagtc 
aaaat cc tea 
gaaatgaagc 
ataccccatc 
tccataaggg 
aacaaataca 
cgccagaaca 
gctggcacca 
agagacacaa 
tctgtagagt 



ctctctggat 
caggcctcat 
tcttcggcac 
tcctgcca.ee 
acatcggctc 
gcctctccct 
tgeagaaect 
tgtttgagga 
tcaatgatgg 
aatttgaaga 
ggcttggagg 
tcactcagaa 
acttagctgc 
caatgaaaaa 
tcatgaagat 
ctgtgggcaa 
aaatctggag 
tccccttctc 
gtttttcact 
ctgctactct 
tggtcaggta 
atattegtet 
actaccaagt 
tcagaaagaa 
aagecatega 
aggeccagag 
acattctgac 
acctcaaacc 
ccttaaggga 
agaatatccg 
gggctgctgg 
caggtggtaa 



acttctagca 
gecagaaage 
cgaccacaaa 
catcgttctg 
catcctgtgg 
ctacctcatc 
gctgttcggc 
agegegegtg 
cattactgtg 
catagaaact 
ggtattgttt 
tatctctgea 
tgaaaccctc 
gtacgtggaa 
gctgagcagc 
gaatcacgag 
agccatcagc 
catcaaggac 
tgcatttttg 
agtagttata 
cctggatgtt 
gatggatcac 
gagagacaag 
cctacccaaa 
gatggtggag 
gatacaagga 
atccaacatg 
ccaaacaagt 
gagcatgagg 
ctacctctcc 
gttctcaggt 
atatctgggg 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2124 



<210> 54 

<211> 2195 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



7477852CB1 



<400> 54 

atggggggtt 

tcctttctgg 

cagaagagga 

cttaggcaac 

acggcgctgc 

gctgccccag 

ctgcacatcg 

gccagtgtct 

tactttggtg 

ctgctcattg 

cacatcctca 

tcctacgatg 

ctcaccccct 

cagaagegga 

acagagatcg 

aaacgagagg 

tggaacaagt 

atetgettta 

cattctcgag 

gaagatatca 

ctcctagaga 

cttggggggc 

gtgatgegge 



ttctacctaa 
tcagagaaca 
ttctagagtc 
ttctactgga 
acatagcagc 
agctggtctt 
ctgttgtgaa 
ctgccagagc 
agcacccttt 
agcatggagc 
tcctccagcc 
gacatgggga 
tcaagctggc 
ggcacatcca 
actcctgggg 
ctcgccaaat 
atggccggcc 
ccacgtgctg 
acatcaccat 
tcaggctggt 
ttccagacat 
cattccatgt 
tcaccaacac 



ggcagaaggg 
agactgggac 
tccactgctt 
ctgcacctgt 
cctctatgac 
tgagcccacc 
ccagaatgtg 
cacaggcact 
gtcctttget 
tgacatcagg 
caacaaaacc 
ccacctgcag 
tggagtggag 
gtggacgtat 
agaggagctg 
tctggaacag 
gtacttctgc 
cgtctaccgc 
cctccagcaa 
gggggagctg 
cttcagggtt 
catcatgatc 
caatggggag 



cccgggagcc 
cagcacctgg 
cgagcatcca 
gaegttcgae 
aacttggagg 
acatgtgagg 
aacctggtgc 
gccttccgcc 
gcctgtgtga 
gcccaggact 
tttgcctgcc 
cccctggacc 
ggtaacactg 
ggacccctga 
tccttcctgg 
accccagtga 
atcctggctg 
ccccttaagt 
aaactactac 
gtgagcatcg 
ggtgcctctc 
acctatgcct 
gtggtgccca 



aactccagaa 
acaagcttca 
aggaaaatga 
aaagaggagc 
cggccttggt 
ettttgeagg 
gtgccctgct 
gtagtccccg 
acagegagga 
ccctgggtaa 
agatgtacaa 
ttgtgcccaa 
tgatgttcca 
cctccattct 
agcttgtggt 
aggagctggt 
ccttgtacct 
ttcgtggtgg 
aggaggecta 
ttggggctgt 
gctattttgg 
ccctggtgct 
tgtcctttgc 



acttctgccc 
tatgetgeag 
cctgtctgtt 
cc tgggggag 
gctgatggag 
teagactgea 
cacccgcagg 
caacctcatc 
gategtgegg 
cacagtatta 
cctgctgctg 
tcaccagggt 
gcacctgatg 
ctacgacctc 
ctcctctgat 
gagcttcaag 
gctctacatg 
caaccgcact 
tgagacacgt 
gatcatcctg 
aaagacgatt 
ggtgaccatg 
cctggtgctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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ggctggtgca gtgtcatgta tttcactcga ggattccaga tgctgggtcc cttcaccatc 1440 
atgatccaga agatgatttt tggagaccta atgcgtttct gctggctgat ggctgtggtc 1500 
atcttgggat ttgcctccgc gttctatatc attttccaga cagaggaccc aaccagtctg 1560 
gggcaattct atgactaccc catggcactg ttcaccacct ttgagctttt tctcactgtt 1620 
attgatgcac ctgccaacta cgacgtggac ttgcccttca tgttcagcat tgtcaacttc 1680 
gccttcgcca tcattgccac actgctcatg ctcaacttgt tcatcgccat gatgggcgac 1740 
acccactgga gggtggccca ggagagggat gagctctgga gggcccaggt cgtggccacc 18 00 
acagtgatgc tggagcggaa gctgcctcgc tgcctgtggc ctcgctccgg gatctgtggg 1860 
tgcgaattcg ggctggggga ccgctggttc ctgcgggttg agaaccacaa tgatcagaat 1920 
cctctgcgag tgcttcgcta tgtggaagtg ttcaagaact cagacaagga ggatgaccag 1980 
gagcatccat ctgagaaaca gccctctggg gctgagagtg ggactctagc cagagcctct 2040 
ttggctcttc caacttcctc cctgtcccgg accgcgtccc agagcagcag tcaccgaggc 2100 
tgggagatcc ttcgtcaaaa caccctgggg cacttgaatc ttggactgaa ccttagtgag 2160 
ggggatggag aggaggtcta ccatttttga ttaac 2195 

<210> 55 

<211> 2055 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1471717CB1 

<400> 55 

cggctctggg ccctcagcct ggctcatgca caactgtctg aagtgctctg gactatggtg 60 
atgaacagcg gccttcagac gcgaggctgg ggaggaatcg tcggggtttt tattattttt 120 
gccgtatttg ctgtcctgac agtagccatc cttctgatca tggagggcct ctctgctttc 180 
ctgcacgccc tgcgactgca ctgggaagct gtttggggaa cttgagctat ttagaagatg 240 
gcaaccaagc caacagagcc tgtcacgatc ctcagccttc ggaaattgag cctggggacc 3 00 
gcagagccac aggttaaaga gccaaagacg ttcaccgtgg aagatgcagt ggagactatc 3 60 
ggcttcgggc gtttccacat tgccctcttt ctgatcatgg gcagtactgg ggtggttgag 420 
gccatggaga tcatgttgat agctgttgtg tctcctgtca tccgctgtga atggcaactg 480 
gagaattggc aggtggcatt agtaaccacg atggtgtttt ttggctacat ggttttcagt 540 
atcctctttg gcctcctggc tgacagatat ggccgctgga agattctgct catctcgttc 600 
ctgtggggag cctatttctc cttgctgacc tcgtttgctc cttcgtacat ctggtttgtc 660 
ttcctgcgga cgatggtggg ctgtggtgtg tccggccact cgcaagggtt aatcataaag 720 
actgaatttt tgcccacgaa ataccgaggc tatatgttac ccttgtctca ggtgttctgg 780 
cttgcgggct ccctgctcat cattggcttg gcctctgtga tcatccccac catcgggtgg 840 
cgctggctca ttcgcgtcgc ctccatcccg ggcatcatcc tcatcgtggc cttcaagttt 900 
attcctgaat ctgcccggtt caatgtctcc actgggaaca ctcgggctgc cctggccact 960 
ctggagcgcg ttgccaagat gaaccgctcg gtcatgccgg aggggaagct ggtggagccc 102 0 
gtcctggaaa aaagaggaag atttgcagac ctattggatg ctaaatattt acggaccaca 1080 
ttacagatct gggtcatatg gcttggaatc tcttttgcct actatggggt tatcctggcc 1140 
agtgctgagc tgctggagcg ggacttggtc tgtggttcaa agtcagactc tgcggtggtg 12 00 
gtgactgggg gggactcagg ggagagccag agcccctgct actgccacat gtttgcaccc 1260 
tctgactatc ggaccatgat catcagcacc atcggtgaaa ttgctttgaa tcctttaaat 1320 
atactgggca tcaatttcct gggaagacgg ctgagccttt ctattaccat gggatgcacg 1380 
gctttattct gccttctcct caacatttgc acttcaagtg ccggcctgat tggcttcctc 1440 
ttcatgctga gggctctggt agctgcaaac ttcaacaccg tctacattta cacagctgag 1500 
gtctacccca ccacgatgcg cgctttgggg atgggaacca gcggctccct gtgtcgcatt 1560 
ggtgcaatgg tggcgccatt tatatcccag gttcttatga gtgcatcaat actgggggcc 162 0 
ctgtgtctct tctcatctgt ctgtgttgta tgcgccattt ctgcattcac tctccccatc 1680 
gaaaccaaag gacgggccct ccagcaaatt aaatgaagac ctgcaaagct atgtctacca 1740 
gatgagaaaa atgaattcta tcttcagaac tgcggtgcat ttttttaaaa cttggtttta 1800 
cttctgtatg ctactcggta attagtaaag tgattttttt ttaaaaggca tatatgggaa 1860 
tggggtaggt aactgtatat tgatctcttc cttgaggaac aatatataaa gtacttttat 1920 
aaaatataat ttaagctttc aaaggggtgt gagagggaga tggtgggggg gaagatggct 1980 
tttcttcgtt gaaatcaagt ctgtaaacct ttatatgaat aaatactaaa ttttaaactt 2040 
acaaaaaaaa aaaaa 2 055 

<210> 56 

<211> 4727 

<212> DNA 

<213> Homo sapiens 

<220> 
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<221> misc_feature 

<223> Incyte ID No: 3874406CB1 

<400> 56 

aagagctgct ggagtaggca cccatttaaa gaaaaaatga agaagcagca ataaagaagt 60 
tgtaatcgtt acctagacaa acagagaact ggttttgaca gtgtttctag agtgcttttt 120 
attattttcc tgacagttgt gttccaccat gattactttc tccttcagcg aataggctaa 180 
atgaatatga aacagaaaag cgtgtatcag caaaccaaag cacttctgtg caagaatttt 240 
cttaagaaat ggaggatgaa aagagagagc ttattggaat ggggcctctc aatacttcta 3 00 
ggactgtgta ttgctctgtt ttccagttcc atgagaaatg tccagtttcc tggaatggct 3 60 
cctcagaatc tgggaagggt agataaattt aatagctctt ctttaatggt tgtgtataca 42 0 
ccaatatcta atttaaccca gcagataatg aataaaacag cacttgctcc tcttttgaaa 480 
ggaacaagtg tcattggggc accaaataaa acacacatgg acgaaatact tctggaaaat 540 
ttaccatatg ctatgggaat catctttaat gaaactttct cttataagtt aatatttttc 600 
cagggatata acagtccact ttggaaagaa gatttctcag ctcattgctg ggatggatat 660 
ggtgagtttt catgtacatt gaccaaatac tggaatagag gatttgtggc tttacaaaca 720 
gctattaata ctgccattat agaagtagct ttggtgttcc tgatgagtgt gctgttaaag 780 
aaagctgtcc tcaccaattt ggttgtgttt ctccttaccc tcttttgggg atgtctggga 840 
ttcactgtat tttatgaaca acttccttca tctctggagt ggattttgaa tatttgtagc 900 
ccttttgcct ttactactgg aatgattcag attatcaaac tggattataa cttgaatggt 9 60 
gtaatttttc ctgacccttc aggagactca tatacaatga tagcaacttt ttctatgttg 1020 
cttttggatg gtctcatcta cttgctattg gcattatact ttgacaaaat tttaccctat 1080 
ggagatgagc gccattattc tcctttattt ttcttgaatt catcatcttg tttccaacac 1140 
caaaggacta atgctaaggt tattgagaaa gaaatcgatg ctgagcatcc ctctgatgat 12 00 
tattttgaac cagtagctcc tgaattccaa ggaaaagaag ccatcagaat cagaaatgtt 126 0 
aagaaggaat ataaaggaaa atctggaaaa gtggaagcat tgaaaggctt gctctttgac 13 2 0 
atatatgaag gtcaaatcac ggcaatcctg ggtcacagtg gagctggcaa atcttcactg 1380 
ctaaatattc ttaatggatt gtctgttcca acagaaggat cagttaccat ctataataaa 1440 
aatctctctg aaatgcaaga cttggaggaa atcagaaaga taactggcgt ctgtcctcaa 1500 
ttcaatgttc aatttgacat actcaccgtg aaggaaaacc tcagcctgtt tgctaaaata 156 0 
aaagggattc atctaaagga agtggaacaa gaggtacaac gaatattatt ggaattggac 162 0 
atgcaaaaca ttcaagataa ccttgctaaa catttaagtg aaggacagaa aagaaagctg 1680 
acttttggga ttaccatttt aggagatcct caaattttgc tcttagatga accaactact 1740 
ggattggatc ccttttccag agatcaagtg tggagcctcc tgagagagcg tagagcagat 1800 
catgtgatcc ttttcagtac ccagtccatg gatgaggctg acatcctggc tgatagaaaa 1860 
gtgatcatgt ccaatgggag actgaagtgt gcaggttctt ctatcttttt gaaaagaagg 1920 
tggggtcttg gatatcacct aagtttacat aggaatgaaa tatgtaaccc agaacaaata 19 80 
acatccttca ttactcatca catccccgat gctaaattaa aaacagaaaa caaagaaaag 2 04 0- 
cttgtatata ctttgccact ggaaaggaca aatacatttc cagatctttt cagtgatctg 2100 
gataagtgtt ctgaccaggg agtgacaggt tatgacattt ccatgtcaac tctaaatgaa 2160 
gtctttatga aactggaagg acagtcaact atcgaacaag atttcgaaca agtggagatg 2220 
ataagagact cagaaagcct caatgaaatg gagctggctc actcttcctt ctctgaaatg 2280 
cagacagctg tgagtgacat gggcctctgg agaatgcaag tctttgccat ggcacggctc 2340 
cgtttcttaa agttaaaacg tcaaactaaa gtgttattga ccctattatt ggtatttgga 24 00 
atcgcaatat tccctttgat tgttgaaaat ataatatatg ctatgttaaa tgaaaagatc 2460 
gattgggaat ttaaaaacga attgtatttt ctctctcctg gacaacttcc ccaggaaccc 2520 
cgtaccagcc tgttgatcat caataacaca gaatcaaata ttgaagattt tataaaatca 2580 
ctgaagcatc aaaatatact tttggaagta gatgactttg aaaacagaaa tggtactgat 2 640 
ggcctctcat acaatggagc tatcatagtt tctggtaaac aaaaggatta tagattttca 27 00 
gttgtgtgta ataccaagag attgcactgt tttccaattc ttatgaatat tatcagcaat 27 60 
gggctacttc aaatgtttaa tcacacacaa catattcgaa ttgagtcaag cccatttcct 2820 
cttagccaca taggactctg gactgggttg ccggatggtt cctttttctt atttttggtt 2880 
ctatgtagca tttctcctta tatcaccatg ggcagcatca gtgattacaa gaaaaatgct 2 940 
aagtcccagc tatggatttc aggcctctac acttctgctt actggtgtgg gcaggcacta 3 000 
gtggacgtca gcttcttcat tttaattctc cttttaatgt atttaatttt ctacatagaa 3 060 
aacatgcagt accttcttat tacaagccaa attgtgtttg ctttggttat agttactcct 3120 
ggttatgcag cttctcttgt cttcttcata tatatgatat catttatttt tcgcaaaagg 3180 
agaaaaaaca gtggcctttg gtcattttac ttcttttttg cctccaccat catgttttcc 3240 
atcactttaa tcaatcattt tgacctaagt atattgatta ccaccatggt attggttcct 33 00 
tcatatacct tgcttggatt taaaactttt ttggaagtga gagaccagga gcactacaga 33 60 
gaatttccag aggcaaattt tgaattgagt gccactgatt ttctagtctg cttcataccc 3420 
tactttcaga ctttgctatt cgtttttgtt ctaagatgca tggaactaaa atgtggaaag 3480 
aaaagaatgc gaaaagatcc tgttttcaga atttcccccc aaagtagaga tgctaagcca 3 540 
aatccagaag aacccataga tgaagatgaa gatattcaaa cagaaagaat aagaacagtc 3 600 
actgctctga ccacttcaat cttagatgag aaacctgtta taattgccag ctgtctacac 3 660 
aaagaatatg caggccagaa gaaaagttgc ttttcaaaga ggaagaagaa aatagcagca 3720 
agaaatatct ctttctgtgt tcaagaaggt gagattttgg gattgctagg acccagtggt 3780 
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gctggaaaaa 
gtggaactga 
gtgctgtggc 
ctcagggaag 
gagcagctga 
gtgctgagcc 
gaccccacag 
agaggtgtcc 
gccatcatgg 
cttggcaagg 
cacactgaga 
ttaacctata 
gaagcagtga 
aaggtattct 
acaacaatga 
aattttcttg 



gttcatctat 
aaggctgcag 
ccatgctgac 
cggacgcgag 
atgttcctgt 
tcctgggaaa 
ggcagcagca 
tcctgaccac 
tgtctggaag 
attacattct 
ttctgaagct 
agctgcccgt 
agcataactt 
tagagctttc 
gatggaaact 
cttgatctcc 



tagaatgata 
ttcagttttg 
gttgagggaa 
gctcgccatc 
gcagaaatta 
ctcacctgtc 
aatgtggcag 
ccataacctg 
gcttagatgc 
agagctaaaa 
tttcccacag 
ggcagacgtt 
taacctggaa 
taaagaacag 
cctccctcat 
tataaactta 



tctgggatca 
ggccacctgg 
cacctggagg 
gcaagattag 
acagcaggaa 
ttgctcctgg 
gcaatccagg 
gc tgaggcgg 
attggctcca 
gtgaaggaaa 
gctgcagggc 
taccctctat 
gaatacagcc 
gaagtaggaa 
tcagatgaac 
tgttttatgt 



caaagccaac 
ggtactgccc 
tgtatgctgc 
tgagtgcttt 
tcacgagaaa 
atgaaccatc 
cagtcgttaa 
aagccttgtg 
tccaacacct 
cgtctcaagt 
agcaaaggta 
cacagacctt 
tttctcagtg 
attttgatga 
cttaaaacct 
aataatt 



tgctggagag 
tcaagagaac 
cgtcaagggg 
caaactgcat 
gttgtgtttt 
tacgggcata 
aaacacagag 
tgaccgtgtg 
gaaaaacaaa 
gactttggtc 
ttcctctttg 
tcacaaatta 
cacactggag 
agaaattgat 
caaacctagt 



<210> 57 

<211> 3852 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4599654CB1 



<400> 57 

cgccggcgat 

gctctggagg 

agcggggtgg 

acccctggac 

ccgatcccca 

acggtcaaca 

gagcgggtga 

tgggacctga 

accttcttca 

ttcttcctac 

gaga tec tgc 

gacctcatct 

ttggacgctg 

ctaagcctgc 

gagatcttcc 

gggatgatgc 

caggacttcc 

cgecagtatt 

cagcaggcac 

gccacatgct 

tcccggcgtc 

ctgccagcag 

atgttcgatg 

aacttcacct 

ttcgtcactg 

gtgcgtgagg 

ctggcccgcg 

ctgetaacta 

tcactcagcg 

tttgagactg 

eggaageget 

caacatgaca 

cttagtggaa 

gtgacctcca 

gactctccag 

ccgctggtgc 

gcccgaaccc 

ccccctccag 

caaccctctc 

agtgagcggc 



tccgagccta 
ggttgcgggt 
gcgccatgga 
tggaggcggt 
aatctgggcc 
agttctccct 
agtcagcggg 
teatgetget 
aggaggagaa 
tggatctggt 
tggcaccgcg 
cttctatccc 
aggtctacaa 
tgaggctget 
acatgaccta 
tgctgctatg 
ctcccgactg 
cccatgccct 
ctgtaggcat 
acgccatgtt 
agtaccagga 
acacgcggca 
aggaaagcat 
gteggggect 
cagttctcac 
gctccgtggg 
gcgcccggga 

ggggccggcg 

tggaccattt 
tggccatgga 
ccgagccaag 
gagacatggc 
agecagtact 
atgtggccat 
ccaccctcct 
ccgtccgagc 
tgcacgccag 
gaggaggtgg 
tgcctcagcg 
tgcctccctc 



cgacgcctcc 
acctgatggc 
ggcagagcag 
gcctcccgtt 
tgagcctaag 
tcgggtgttc 
ggcctggatc 
gctgatggtg 
ctccccgcct 
gctcaacttc 
ggccatccgc 
tgtggattac 
aaeggcaegg 
ccgcctctcc 
tgacctggcc 
tcactgggat 
ctgggtctcc 
gttcaaggee 
gcccgacgtc 
catcggccat 
gaagtacaag 
gcgcatccac 
ectgggegag 
ggtggcccac 
caagctgcgc 
gaggaagatg 
cacacgcctc 
cacagccagt 
caatgctgtg 
tcggctgctc 
tccaggcagc 
tcggggtgtt 
gtgggagcca 
tgccctgact 
tgctcgctct 
tggcccatgg 
cctatcccgg 
aeggeggcta 
ggcaacaggc 
agggctcctg 



gctagagccc 
cacagagggc 
cggccggcgg 
gctcccccgc 
aggaggcacc 
ggcagccaca 
atccacccct 
gggaacctca 
tggategtet 
egaaegggea 
acgcgctacc 
atcttcctag 
gccctacgca 
cgcctcatcc 
agtgctgtgg 
ggctgtctgc 
atcaaccaca 
atgagecaca 
tggctcacca 
gccacggcac 
caggtggagc 
gagtactatg 
ctgagegage 
atgccgctgt 
tttgaggtct 
tacttcatcc 
accgatggat 
gttcgggctg 
cttgaggagt 
cgcatcggca 
agtggtggca 
eggggteggg 
ctggtacatg 
catcageggg 
gettggeget 
gcatccacct 
geagggeget 
ggacctcggg 
gatggctctc 
gccaaacctc 



gcggggctgc 
tctaggaggc 

egggggecag 

ctgcgaccgc 
ttgggacgct 
aagcagtgga 
acagegaett 
tcgtcctgcc 
teaaegtatt 
tcgtggtgga 
tgcgcacctg 
tggtggagct 
tegttcgett 
gctacataca 
ttegcatett 
agttcctggt 
tggtgaacca 
tgctgtgcat 
tgctcagcat 
tcatccagtc 
agtacatgtc 
agcaccgcta 
cgcttcgcga 
ttgcccatgc 
tccagccggg 
agcatgggct 
cctactttgg 
acacctactg 
tccccatgat 
agaagaattc 
tcatggagca 
ccccgagcac 
cgccccttca 
gccctctgcc 
cagcaggctc 
cccgcctgcc 
cccaggtctc 
gccgcccact 
ctgggcgtaa 
caaggacagc 



3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4727 



gccgactcct 
cgagcgtgta 
egaaggggeg 
ggcctcaggt 
gctccagcct 
aatcgagcag 
ccggttttac 
tgtgggcatc 
gtctgatact 
ggagggtgct 
gttcctggtt 
ggagecaegg 
caccaagatc 
ccagtgggag 
caacctcatt 
gcccatgctg 
ctcgtggggc 
tggctatggg 
gategtaggt 
cctggactct 
cttccacaag 
ccagggcaag 
ggagatcatt 
cgaccccagc 
ggatctcgtg 
gctcagtgtg 
ggagatctgc 
ccgcctttac 
gcgccgggcc 
catactgeag 
gcatttggtg 
aggagc tcag 
ggcagctget 
cctctcccct 
tccagcttcc 
cgccccacct 
cctgctgggt 
ctcagcctcc 
gggatcagga 
ccagcccccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 
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aggccaccag tgcctgagcc agccacaccc 
aacctttgag tacatccagc cttagttctt 
tgcctcttgg ggaaggccat ggggacctga 
gcggacattc cgcatactgc catgaagacg 
agcttgtccc atcataatcc attcacccgt 
caaggtaata tgccaggcgc tgtatgtctc 
ctgacaagga tattcccttg gctatggtcc 
acctttacta agcacaagta cttgccactg 
tttccctgcc aatgatcctg caggttctgc 
catagccagg cactgccagt cacctgtgcc 
tgagtgtggg tatccacttt tacccgctca 
tctgagtggg aacatacttc taagttccct 
ttgtaaactg cgagcttcct cttccctgta 
ttaaggggta gttggcacct ccttattaat 
gcaaatggct gaatccttgt gtgatatttt 
taattgtatt tattcattta ctaactttat 
ctttatccat ccctcccctc cttttcaggt 
gcctctccat gccagcctct gtggtccttg 
tctcccaggt aggggcagga ggagccacat 
agagttttgt tttgtttttt ccttctgagt 
ccaaggtatc caagcctggg gaagggcagg 
agagtagtcc tttaccaccc tcactctgcc 
actgtgggac caggacccta agtctccttt 
ggtgtgtggg agaagtttta taattgcttc 
acactcaaaa aa 



cggggtctcc agctttctgc caacatgtaa 2460 
ggggtgcagt agtatgtacc caagggcaga 252 0 
aacattgccc catggaaatg tcgaccctgt 2580 
gtctctgtgt cctcagctca agaatcctgt 2640 
tcatcatgtg tactgagcag ctaccatgtt 2700 
cactgccaag tagaagtgac tcaaaaccct 27 60 
tgccaggtgc aggcccaggc ccatgacccc 2820 
ccatcactgc caagtaacta gatgtctctg 2880 
ccggtctggt tatcttcctg tttcctgtag 2940 
cccattgctg tcagcagatg tcttgggtcc 3 000 
ctgccacctg tggacactct gtgtctaccc 3 060 
gcagtctctg tcctgtggta gaccatcttt 3120 
ccctctgccc cagtcgtgac cccctaaaag 3180 
atgccagcct agatcccccc cggtggaggg 3 240 
tttcttcgct tgtttattta ttcatttatt 3300 
gtgttaccaa ttaattttgt ttacccattc 3360 
aaggagacag gaggagtagg aggaggcagg 342 0 
cccaaaccca tcagcgcaat acttgaacct 3480 
gagagaggga gaaggaccgc gtttaccttt 3 54 0 
ttgctgttgg tgcaggaata agggaaaggc 3 600 
ccagccagca cctctgcctt ctcagggaca 3 660 
tgtcccctct cctactctac agcattaaag 3720 
ccttctgggt ggggagttct aggggttctt 3780 
caaacagctg ggtttaaata taaaatagac 3 840 

3852 



<210> 58 
<211> 1917 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_feature 

<223> Incyte ID No: 5047435CB1 



<400> 58 

atggcagaag gtgaaagggg agcagacgtg 
gtggcgttgg cggcgctgcg cgcgggaggg 
ccggaaagtt tgtccggcgg cagcggcgtt 
ccctcagcgc ccccagcacg ccgccgagtc 
ctgcctttgt gtgcctctgt gtctttgctg 
gtcatatcag gtgccctgct gccactgcag 
gagttcctgg tgggcagcct gctcctgggg 
ctcattgact gctatggcag gaagcaagcc 
ggcagcctga ccc tgggcct ggctggttcc 
gttggcttcg ccatttccct ctcctccatg 
gggccacggc agcggggagt gctggtgtcc 
ctgctctcct atgccctcaa ctatgcactg 
ttcggctggg ccactgcacc tgctgtcctg 
ggtacagatg agactgcaac acacaaggac 
aagctgggcc cggggaggcc acggtactcc 
atgcgaggcc ggaccacagt gggcctgggg 
cccaacgtgc tgtgctatgc ctccaccatc 
tcagccgtgc tggcctctgt ggggcttggc 
atggggctgg tggaccgtgc aggccgcagg 
gccctgtccg tcagtggcat aggcctcgtc 
agctgtctgg ctgtgcccaa tgccaccggg 
ctgcaggact cctctctacc tcccattcca 
ttgtccactg ctaagaaaac caagccccat 
cggctggccc tgagctctgc cctccctggg 
ctgcgctgga ccgcactgct gtgcctgatg 
gggccagtga cctggcttgt cctcagcgag 
ttcgccttct gcaacagctt caactgggcg 
gatctcattg gcaccatcgg cttgtcctgg 
ctcggcctgg gcttcatcta tttatttgtt 
atagaccagc agttccagaa gagacggttc 
actggcatcc cgtacagccg catcgagatc 



ccacatggcc tcggggcctg gctggccgac 60 
cagggcagga gggacagagg cgggggcggg 120 
ggggactccg gcgggggatg cgcgcccggc 180 
ccgctcgcca tgggccactc cccacctgtc 240 
ggtggcctga cctttggtta tgaactggca 300 
cttgactttg ggctaagctg cttggagcag 3 60 
gctctcctcg cctccctggt tggtggcttc 420 
atcctcggga gcaacttggt gctgctggca 480 
ctggcctggc tggtcctggg ccgcgctgtg 540 
gcttgctgta tctacgtgtc agagctggtg 600 
ctctatgagg caggcatcac cgtgggcatc 660 
gctggtaccc cctggggatg gaggcacatg 72 0 
caatccctca gcctcctctt cctccctgct 780 
ctcatcccac tccagggagg tgaggccccc 840 
tttctggacc tcttcagggc acgcgataac 900 
ctggtgctct tccagcaact aacagggcag 960 
ttcagctccg ttggtttcca tgggggatcc 1020 
gcagtgaagg tggcagctac cctgaccgcc 1080 
gctctgttgc tagctggctg tgccctcatg 1140 
agctttgccg tgcccatgga ctcaggccca 1200 
cagacaggcc tccctggaga ctctggcctg 1260 
aggaccaatg aggaccaaag ggagccaatc 1320 
cccagatctg gagacccctc agcccctcct 13 80 
ccccctctgc ccgctcgggg gcatgcactg 1440 
gtctttgtca gtgccttctc ctttgggttt 1500 
atctaccctg tggagatacg aggaagagcc 1560 
gccaacctct tcatcagcct ctccttcctc 1620 
accttcctgc tctacggact gaccgctgtc 1680 
cctgaaacaa aaggccagtc gttggcagag 1740 
accctgagct ttggccacag gcagaactcc 1800 
tctgcggcct cctgaggaat ccgtctgcct 1860 
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ggaaattctg gaactgtggc tttggcagac catctccagc atcctgcttc ctaggcc 



1917 



<210> 59 
<211> 6791 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 
<223> Incyte ID No: 



7475603CB1 



<400> 59 

cgcgctccct 

gagcttccag 

tgcccgcagc 

ccctgcccag 

tcatgtatcg 

tcttcttcat 

acttcccaaa 

gtaatgtgaa 

gcaacttcaa 

gggccagtgc 

cacgcagcac 

atgtcgcgga 

aagcccagga 

tggcgctgcg 

gccccctgga 

gcccctccct 

aatccgccct 

acagccaccc 

tactctttgc 

tcgaggagct 

tcttcacctt 

aggatgaagg 

cc tttctgga 

acctggtggg 

caccctcaga 

gggccggcgt 

cagacctggg 

ggaccaataa 

acctgcgcta 

tccgcgtgct 

cgtgctatgt 

cgctggcctg 

cgcggctgcg 

ggttcctcag 

agctggggga 

tcgcggtggc 

tggctgcggc 

tggcttggcg 

tggccttcgg 

agtggcacaa 

gccttctgct 

gcccaggcca 

gcggacctcg 

tggtagaaga 

gctttcctgg 

acatcaccgc 

gtggcctctt 

gcatggccgc 

tgctgaccgt 

tagtgggccc 

tgcagactcg 

tgggcggctc 

gccgcggtat 

cccaccacct 

gcttgtgctg 



gcctgctgct 
gaaccctgcg 
cgcaccgcac 
tctcaccatg 
ccggagacag 
cctggtggct 
caagccactg 
caacacctgc 
cgactccctg 
ccacaggacg 
ggcccagcct 
gctgctgacg 
gcccttgcac 
cagcctggtg 
gttgctgtca 
caactggtac 
gccagacagc 
gctgtcccgc 
accagataca 
caccctgctg 
catgaacgac 
aagaaggcag 
ccctgggagc 
cacgctgggc 
ggcagccctg 
cgtcttcttg 
ccccggccac 
gatcagggac 
cgtgtggggc 
cagcggcgcc 
ggacgacgtg 
gatctactcc 
ggacaccatg 
ctgcctcggg 
catcctcccc 
cacggtgacc 
ctgcggcggc 
ggaccggctg 
cttcggctgc 
cgtgggcacc 
gctggacgcg 
gtacgggatc 
gccccccaag 
ggcaccgccc 
aagcccgcag 
cttcctgggc 
cccacccagt 
catccggccc 
ggacgagcac 
cgagcaggac 
ccacctctct 
ccaagttgtt 
ttgggagctg 
ggatgaggca 
ctgtggatcc 



gggcggaggg 
ctgtgggata 
gtcttcagcc 
gccttctgga 
ccggtccagc 
gttcgccact 
ccatcggcgg 
tttccgcagc 
gtctcccggc 
ctggctggcc 
caaccaacca 
tcactgctgc 
agcttgttgg 
gagcttcggg 
gaggccctct 
gaggctagtg 
agcctgagcc 
ctgctctgga 
ccttttaccc 
agggatgtcc 
agttccaatg 
cccagacctg 

ggtggctaca 

cgagtgacgg 
gtgtcgcggg 
ggacctgagg 
gtgcgcatca 
aggttttggg 
ggcttcgtgt 
aacccccggg 
ttcctgcgtg 
gtgacactga 
cgcgccatgg 
cccttcctgc 
tacagccacc 
cagagcttcc 
ctggcctact 
cccgcgggtg 
gagagcctgg 
cggcctacgg 
gcgctctacg 
cctgaaccat 
agtccagccc 
ggcctgagtc 
ccagccctgc 
cacaacgggg 
ggtggctctg 
cacctgggcg 
gtctggttct 
cgtctgctgc 
ggtgggatgc 
atcctggacg 
ctgctcaaat 
gagctgctgg 
ccactcttcc 



aaggcggcaa 
aaggaatgag 
cgaccgttgt 
cacagctgat 
tcctggtcga 
cccacccgcc 
gcaccgtgcc 
tgacaccggg 
tgctagccga 
tagggaagct 
agcagtctcc 
gcacggaatc 
aggccgctga 
cactgctgca 
gcagtgtcag 
acctgatgga 
ccgcctgctc 
gacgcctgaa 
ggaagctcat 
gggaggtgtg 

tggccatgct 
gaggccggga 
gc tggcagga 
agtgcctgtc 
ccctgcaact 
actcttcaga 
aaatccgcat 
accctggccc 
acctgcaaga 
ccggcctcta 
tgctgagccg 
cagtgaaggc 
ggctcagccg 
tcagcgccgc 
cgggcgtggt 
tgctcagcgc 
tctccctcta 
gccgcgtggc 
ctctgctgga 
cagacgtctt 
gcctcgccac 
ggaattttcc 
cttgccccac 
ctggcgtctc 
gggggctcag 
ccggcaagac 
ccttcatcct 
tctgtcctca 
atgggcggct 
aggatgtggg 
aacggaagct 
agcctacggc 
accgagaagg 
gagaccgtgt 
tgcgccgtca 



gagctgcgga 
gttcagaaag 
cctgacctct 
gctgctgctc 
attgctgtgg 
cctggagcac 
ctggctccag 
cgaggagccc 
tgcccgcact 
gatcgccacg 
actggaacca 
cctggggttg 
ggacctggcc 
gagaccccga 
gggacctagc 
gctggtgggg 
ggagctgatt 
gcctctgatc 
ggcccaggtg 
ggagatgctg 
gcagcggctc 
ccacatggag 
cgcacacgct 
cttggacaag 
gctcgcggaa 
ccccacagag 
ggacattgac 
agccgcggac 
cc tggtggag 
cctgcagcag 
gtcgctgccg 
cgtggtgcgg 
cgcggtgctc 
gctgctggtt 
cttcctgttc 
cttcttctcc 
cctgccctac 
cgcgagcctg 
ggagcagggc 
cagcctggcc 
ctggtacctg 
ttttcggagg 
cccgctggac 
cgttcgcagc 
cctggacttc 
caccaccctg 
gggccacgac 
gtacaacgtg 
gaagggtctg 
gctggtctcc 
gtccgtggcc 
tggcgtggat 
tcgcacgctg 
ggccgtggtg 
cctgggctcc 



gcccctggaa 
gggcaggagt 
ctgtcccgtc 
tggaagaatt 
cctctcttcc 
catgaatgcc 
ggtctcatct 
gggcgcctga 
gtgctgggag 
ctgagggctg 
cccatgctgg 
gcactgggcc 
caggagctcc 
gggaccagcg 
agcacagtgg 
caggagccag 
ggagccctgg 
ctcgggaagc 
aaccggacct 
ggaccccgga 
ctgcagatgc 
gccctgcgat 
gatgtggggc 
ctggaggcgg 
catcgattct 
cacccaaccc 
gtggtcacga 
cccctgaccg 
cgtgcagccg 
atgccctatc 
ctcttcctga 
gagaaggaga 
tggctaggct 
ctggtgctca 
ttggcagcct 
cgcgccaacc 
gtgctgtgtg 
ctgtcgcccg 
gagggcgcgc 
caggtctctg 
gaagctgtgt 
agctactggt 
ccaaaggtgc 
ctggagaagc 
taccagggcc 
tccatcttga 
gtccgctcca 
ctgtttgaca 
agtgccgctg 
aagcagagtg 
attgcctttg 
cctgcttccc 
atcctctcca 
gcaggtggcc 
ggctactacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 - 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 
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tgacgctggt gaaggcccgc ctgcccctga ccaccaatga gaaggctgac actgacatgg 33 60 
agggcagtgt ggacaccagg caggaaaaga agaatggcag ccagggcagc agagtcggca 3 420 
ctcctcagct gctggccctg gtacagcact gggtgcccgg ggcacggctg gtggaggagc 3480 
tgccacacga gctggtgctg gtgctgccct acacgggtgc ccatgacggc agcttcgcca 3 540 
cactcttccg agagctagac acgcggctgg cggagctgag gctcactggc tacgggatct 3 600 
ccgacaccag cctcgaggag atcttcctga aggtggtgga ggagtgtgct gcggacacag 3 660 
atatggagga tggcagctgc gggcagcacc tatgcacagg cattgctggc ctagacgtaa 3720 
ccctacggct caagatgccg ccacaggaga cagcgctgga gaacggggaa ccagctgggt 3780 
cagccccaga gactgaccag ggctctgggc cagacgccgt gggccgggta cagggctggg 3840 
cactgacccg ccagcagctc caggccctgc ttctcaagcg ctttctgctt gcccgccgca 3 900 
gccgccgcgg cctgttcgcc cagatcgtgc tgcctgccct ctttgtgggc ctggccctcg 3 9 60 
tgttcagcct catcgtgcct cctttcgggc actacccggc tctgcggctc agtcccacca 4020 
tgtacggtgc tcaggtgtcc ttcttcagtg aggacgcccc aggggaccct ggacgtgccc 4080 
ggctgctcga ggcgctgctg caggaggcag gactggagga gcccccagtg cagcatagct 4140 
cccacaggtt ctcggcacca gaagttcctg ctgaagtggc caaggtcttg gccagtggca 4200 
actggacccc agagtctcca tccccagcct gccagtgtag ccggcccggt gcccggcgcc 42 60 
tgctgcccga ctgcccggct gcagctggtg gtccccctcc gccccaggca gtgaccggct 4320 
ctggggaagt ggttcagaac cagacaggcc ggaacctgtc tgacttcctg gtcaagacct 438 0 
acccgcgcct ggtgcgccag ggcctgaaga ctaagaagtg ggtgaatgag gtcagatacg 444 0 
gaggcttctc gctggggggc cgagacccag gcctgccctc gggccaagag ttgggccgct 4500 
cagtggagga gttgtgggcg ctgctgagtc ccctgcctgg cggggccctc gaccgtgtcc 4560 
tgaaaaacct cacagcctgg gctcacagcc tggatgctca ggacagtctc aagatctggt 462 0 
tcaacaacaa aggctggcac tccatggtgg cctttgtcaa ccgagccagc aacgcaatcc 4680 
tccgtgctca cctgccccca ggcccggccc gccacgccca cagcatcacc acactcaacc 4740 
accccttgaa cctcaccaag gagcagctgt ctgaggctgc actgatggcc tcctcggtgg 4800 
acgtcctcgt ctccatctgt gtggtctttg ccatgtcctt tgtcccggcc agcttcactc 4860 
ttgtcctcat tgaggagcga gtcacccgag ccaagcacct gcagctcatg gggggcctgt 492 0 
cccccaccct ctactggctt ggcaactttc tctgggacat gtgtaactac ttggtgccag 4980 
catgcatcgt ggtgctcatc tttctggcct tccagcagag ggcatatgtg gcccctgcca 5040 
acctgcctgc tctcctgctg ttgctactac tgtatggctg gtcgatcaca ccgctcatgt 5100 
acccagcctc cttcttcttc tccgtgccca gcacagccta tgtggtgctc acctgcataa 5160 
acctctttat tggcatcaat ggaagcatgg ccacctttgt gcttgagctc ttctctgatc 5220 
agaagctgca ggaggtgagc cggatcttga aacaggtctt ccttatcttc ccccacttct 5280 
gcttgggccg ggggctcatt gacatggtgc ggaaccaggc catggctgat gcctttgagc 53 40 
gcttgggaga caggcagttc cagtcacccc tgcgctggga ggtggtcggc aagaacctct 54 0 0 
tggccatggt gatacagggg cccctcttcc ttctcttcac actactgctg cagcaccgaa 5460 
gccaactcct gccacagccc agggtgaggt ctctgccact cctgggagag gaggacgagg 5520 
atgtagcccg tgaacgggag cgggtggtcc aaggagccac ccagggggat gtgttggtgc 5580 
tgaggaactt gaccaaggta taccgtgggc agaggatgcc agctgttgac cgcttgtgcc 5640 
tggggattcc ccctggtgag tgttttgggc tgctgggtgt gaatggagca gggaagacgt 57 00 
ccacgtttcg catggtgacg ggggacacat tggccagcag gggcgaggct gtgctggcag 57 60 
gccacagcgt ggcccgggaa cccagtgctg cgcacctcag catgggatac tgccctcaat 582 0 
ccgatgccat ctttgagctg ctgacgggcc gcgagcacct ggagctgctt gcgcgcctgc 588 0 
gcggtgtccc ggaggcccag gttgcccaga ccgctggctc gggcctggcg cgtctgggac 5940 
tctcatggta cgcagaccgg cctgcaggca cc tacagcgg agggaacaaa cgcaagctgg 60 00 
cgacggccct ggcgctggtt ggggacccag ccgtggtgtt tctggacgag ccgaccacag 6060 
gcatggaccc cagcgcgcgg cgcttccttt ggaacagcct tttggccgtg gtgcgggagg 6120 
gccgttcagt gatgctcacc tcccatagca tggaggagtg tgaagcgctc tgctcgcgcc 6180 
tagccatcat ggtgaatggg cggttccgct gcctgggcag cccgcaacat ctcaagggca 6240 
gattcgcggc gggtcacaca ctgaccctgc gggtgcccgc cgcaaggtcc cagccggcag 63 00 
cggccttcgt ggcggccgag ttccctgggg cggagctgcg cgaggcacat ggaggccgcc 63 60 
tgcgcttcca gctgccgccg ggagggcgct gcgccctggc gcgcgtcttt ggagagctgg 6420 
cggtgcacgg cgcagagcac ggcgtggagg acttttccgt gagccagacg atgctggagg 6480 
aggtattctt gtacttctcc aaggaccagg ggaaggacga ggacaccgaa gagcagaagg 6540 
aggcaggagt gggagtggac cccgcgccag gcctgcagca ccccaaacgc gtcagccagt 6600 
tcctcgatga ccctagcact gccgagactg tgctctgagc ctccctcccc tgcggggccg 66 60 
cggggaggcc ctgggaatgg caagggcaag gtagagtgcc taggagccct ggactcaggc 6720 
tggcagaggg gctggtgccc tggagaaaat aaagagaagg ctggagagaa gccgtggtgg 67 80 
tgaaaaaaaa a 67 91 

<210> 60 

<211> 5214 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
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<223> Incyte ID No: 7477845CB1 



<400> 60 

atgctcaaaa 

gatgagtctc 

ttgc tgcgca 

ttcgagcact 

ctctacacgg 

tcctatgtga 

tctttggtgc 

atgttgcgga 

gaactgccaa 

gtttccattt 

ggaacattta 

agtttagcta 

cctggattta 

agtggcttta 

ggctgggtgt 

tatttcatca 

atcattgaaa 

agcactacct 

ctggtagctg 

atgcggtcat 

gcggctagca 

gcggaggtgg 

ggat ttactg 

actactcttc 

cgagtagttc 

ggtcctggaa 

tcagcaatta 

ccgagggcat 

gaccaaactc 

ctctatcatc 

aacttagaac 

gcggacacca 

cctcaaatgg 

agttttatga 

ctcccgacca 

aaatacatcg 

gaaaaggaga 

ggatcatttg 

cgccaagagc 

tctcttgaaa 

gaagtcagtg 

aagtgcaaaa 

gggcgagaac 

tctaaaacag 

ttgctgggat 

atttccatga 

gctgagtatg 

ggcttatttt 

atatatcttg 

ggagctcagc 

ccccagatga 

tccattcttt' 

ggaaaactgg 

ttcagaatta 

aaacctggat 

gtgggaaacg 

agagatgtta 

ttcctgggtt 

gaaaacaagg 

cgactgaaga 

gctaaaatgt 

ctggcccagt 

ttggcaacaa 

atagcaatgt 



ggaagcagag 
tgtcggataa 
tctgtgccat 
atcctccact 
cagagatgat 
aagatcgctg 
tacaggtgtt 
ttccacggcc 
ggaccagaat 
ttctactttt 
cttatcactg 
ttccagacac 
aatgcatgga 
atgagatagg 
tcctcatgta 
ctctcatttt 
catttgcaga 
caacagccac 
tggatgtcaa 
ccgttttcca 
actactacaa 
cttttacagt 
gatatattag 
atgtataccc 
ggctgattaa 
aaaagcttgg 
gtttgcagat 
ttatgtccat 
taaatgctgt 
tttttgccac 
ttgatgaaga 
aagaaaagc t 
tgaaaatctc 
agcagtttat 
cctcttcctc 
accaaaaact 
ccgcagtcac 
aggggcagcc 
gcaggtcact 
ctttgactca 
aaataaagat 
aggaactcag 
acaggttcag 
accctgtcac 
tggtcactta 
tgtttgagtc 
tgtttgtgat 
tcactccaac 
tgagcttgat 
ttctaatggt 
ggaaagttgt 
tgctgacatt 
ccaagtgcaa 
atgtcagtgt 
tttgggtgcc 
ctatgctggc 
ttattcatcg 
gcatgattgg 
ggacggcttt 
tcgcacagcc 
atgacataac 
cggtgttgct 
tgtcagttgt 
cgcctgctgg 



ttccagggtg 
tgctgacatc 
catcagcgtc 
tcagtatgtg 
agcaaaaatg 
gtgtgttttt 
tgaaattgct 
actgattatg 
tacaaatatt 
ctttctactt 
tgttgtaaat 
acactgctca 
cc ttgaagat 
aactagtata 
cagagcaatt 
cttcctcgcc 
aatcagagta 
cacccagatg 
caagccccag 
catgttcatc 
aggagaaaac 
actttttgat 
ctcatctctc 
agatctttat 
gatttcacct 
gagtttggtt 
gttctgcttt 
gttccagatc 
gggacatatg 
tctgatcctc 
cctaaagaag 
ccctttacgc 
aaagcttcct 
tgaccgccag 
ctcctgcgac 
tcgcaagtct 
taaaatctta 
cgcaaaggag 
aagacatgga 
agatcattgc 
gattcaggaa 
agagagccac 
aaact tttgc 
aggagctgtg 
cctggactgg 
cccgtttcga 
attcatgagc 
tgctgtcatc 
atttctttgt 
ccttcggtgc 
tcgagaactt 
aatgctcgtt 
tgatcccaac 
gtcaaagaac 
ccgtgtttgg 
gttgtttgaa 
tgtggggccg 
actgaccctt 
gctgaccgtc 
tcttcatctt 
ccagcatcca 
ctctgtcaag 
tttcaccttc 
cttctggcaa 



gaagcccagc 
ctctggatta 
atttctgttt 
accttcactt 
cacatccggg 
gatggattta 
gatatagttg 
atccgagcat 
ttaaagcgat 
ctttatggaa 
gacacaaagc 
ccagagctag 
ctgggactta 
ttcaccgtct 
gacagctttc 
tggcttgtga 
cagtttcaac 
tttcatgaag 
ggacgcgccc 
ctgagcatgg 
ttcaggaggc 
ttggaagcac 
cacaaat teg 
cattcacaat 
gcattagaag 
gtatttactg 
gtcgaagaac 
ctcacccagg 
tgggcacccg 
ctgagtttgt 
cttaaacaat 
ctgegaatet 
tcagatttta 
caacaggaca 
cactccaaac 
gttttcagca 
agggcttgea 
aggtcaatcc 
tcaaacagcc 
aatacagtga 
aaaaaggagc 
ccatacttcg 
cgggtggtgg 
aaaaatacaa 
gtcatgatca 
agagtcatgc 
attgagctta 
agggacttcg 
tggatgeetc 
ctgagacctc 
ttcagegget 
tttgeaaget 
attattagaa 
ttaaatttaa 
gcgaatcctc 
gttctctcct 
atccatggaa 
tttgttggag 
gatcagagaa 
ccgcctcgcc 
ttttttaaga 
tgggaegteg 
atctttgttc 
agcagaagaa 



cagtcactga 
acaaaccatg 
gtatgaatac 
tggatacatt 
gcattgtcaa 
tggtcttttg 
atcagatgtc 
teeggattta 
egggagaaca 
ttttaggagt 
cagggaatgt 
aagaaggcta 
gcaggcaaga 
atgaggcege 
cccgttggcg 
agaacgtgtt 
aaatgtgggg 
atgctgctgg 
cagcctgcct 
tgaccgtgga 
agtacgacga 
ttctgaagat 
aactac tact 
teaegtaett 
actttgtgta 
ccagcctctt 
tggacagatt 
aaggatgggt 
tggttgccat 
ttgttgctgt 
taaagcaaag 
ttgaaaaatt 
cagttcctaa 
catgttgcct 
gctcagcaat 
tcagggcaag 
cccgacagcg 
teagegtgea 
agaggatcag 
tatatagaaa 
tagcagagat 
ataagecact 
tccgagcacg 
agtaccatct 
tcgtaacctc 
atgcacctac 
atctgaagat 
gtggagtaat 
aaaatgtacc 
tgegcatatt 
tcaaggaaat 
ttggagttca 
gggaagattg 
aattgaggee 
ggaactttaa 
tgaaaggctg 
tctatattca 
tagttattgc 
gatgggaaga 
eggataatga 
ggacaatege 
aggacceggt 
tggaggtaac 
accgatacga 



ctttggtcct 
ggttcactct 
gccaatgacc 
attgatgttt 
gggggatagt 
cctttgggtt 
accttggggc 
tttccgattt 
aatatggagt 
tcagatgttt 
aacctggaat 
ccagtgccca 
gctgggctac 
ctcacaggaa 
ttcctacttc 
tattgctgtt 
atcgagaagc 
aggttggcag 
ccagaaaatg 
cgtgatcgtg 
gttctacctg 
atggtgtttg 
cgtaat tgga 
tcaggtactc 
caagatattt 
gattgttatg 
tactacgttt 
ggacgtaatg 
ctatttcatt 
tattttggac 
tgaagcaaat 
tccaaacaga 
aatcagggag 
tctgagaagc 
tgaggacaac 
gaaccttctg 
cat get gage 
gcatcatatc 
caggggaaaa 
tgctcaaaga 
gcttcaagga 
gttcattgtc 
cttcaacgcg 
tctttatgat 
tgactcttgc 
tttgeagatt 
tatggcagat 
ggacatattt 
tgetgaateg 
caaactggtg 
ttttttggtc 
getttttget 
caatggcata 
tggagagaaa 
tttcgacaat 
ggtggaagtg 
tgtttttgta 
taatttcaat 
cctgaagagc 
tggttttaga 
attactegtc 
gaccgtacct 
catgaagatc 
tctcctggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

13 80 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2-580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 
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acgtcgcttg gcgttgtatg ggtggtgctt cactttgccc tcctgaatgc atatacttac 3900 

atgatgggcg cttgtgtgat tgtatttagg tttttctcca tctgtggaaa acatgtaacg 3 960 

ctaaagatgc tcctcttgac agtggtcgtc agcatgtaca agagcttctt tatcatagta 4020 

ggcatgtttc tcttgctgct gtgttacgct tttgctggag ttgttttatt tggtactgtg 4080 

aaatatgggg agaatattaa caggcatgca aatttttctt cggctggaaa agctattacc 4140 

gtactgttcc gaattgtcac aggtgaagac tggaacaaga ttatgcatga ctgtatggta 4200 

cagcctccgt tttgtactcc agatgaattt acatactggg caacagactg tggaaattat 4260 

gctggggcac ttatgtattt ctgttcattfc tatgtcatca ttgcctacat catgctaaat 4320 

ctgcttgtag ccataattgt ggagaatttc tccttgattt attccactga ggaggaccag 43 80 

cttttaagtt acaatgatct tcgccacttt caaataatat ggaacatggt ggatgataaa 4440 

agagaggtat tccccacgtt ccgcgtcaag ttcctgctgc ggctactgcg tgggaggctg 4500 

gaggtggacc tggacaagga caagctcctg tttaagcaca tgtgctacga aatggagagg 4560 

ctccacaatg gcggcgacgt caccttccat gatgtcctga gcatgctttc ataccggtcc 4620 

gtggacatcc ggaagagctt gcagctggag gaactcctgg cgagggagca gctggagtac 4680 

accatagagg aggaggtggc caagcagacc atccgcatgt ggctcaagaa gtgcctgaag 4740 

cgcatcagag ctaaacagca gcagtcgtgc agtatcatcc acagcctgag agagagtcag 4800 

cagcaagagc tgagccggtt tctgaacccg cccagcatcg agaccaccca gcccagtgag 4860 

gacacgaatg ccaacagtca ggacaacagc atgcaacctg agacaagcag ccagcagcag 492 0 

ctcctgagcc ccacgctgtc ggatcgagga ggaagtcggc aagatgcagc cgacgcaggg 4980 

aaaccccaga ggaaatttgg gcagtggcgt ctgccctcag ccccaaaacc aataagccat 5040 

tcagtgtcct cagtcaactt acggtttgga ggaaggacaa ccatgaaatc tgtcgtgtgc 5100 

aaaatgaacc ccatgactga cgcggcttcc tgcggttctg aagttaagaa gtggtggacc 5160 

cggcagctga ctgtggagag cgacgaaagt ggggatgacc ttctggatat ttag 5214 

<210> 61 

<211> 1818 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 168827CB1 

<400> 61 

ggaaattgct tccgtgaccc tgctgcagat gggagagagg gcccattaag aagagagtgg 60 
ggtcaggatc aacacacaca cttagtgtga tttaaggaaa ggaaatattt tctctttgaa 120 
cttatctgga tacagtcatt ttgtctcctc ttggggatca cttgtccagc ctcaatggcc 180 
tttcaggacc tcctagatca agttggaggc ctggggagat tccagatcct tcagatggtt 240 
ttccttataa tgttcaacgt catagtatac catcaaactc agctggagaa cttcgcagca 3 00. 
ttcatacttg atcatcgctg ctgggttcat atactggaca atgacactat ccctgacaat 3 60 
gaccctggga ccctcagcca ggatgccctc ctgagaatct ccatcccatt cgactcaaat 420' 
ctgaggccag agaagtgtcg tcgctttgtc catccccagt ggaagctcat tcatctgaat 48 0 
gggaccttcc ccaacacgag tgagccagat acagagccct gtgtggatgg ctgggtatat 540 
gaccaaagct ccttcccttc caccattgtg actaagtggg atctggtatg cgaatctcaa 600 
ccactgaatt cagtagctaa atttctattc atggctggaa tgatggtggg aggcaaccta 660 
tatggccatt tgtcagacag gtttgggaga aagttcgtgc tcagatggtc ttacctccag 72 0 
ctcgccattg taggcacctg tgcggccttt gctcccacca tcctcgtata ctgctccctg 780 
cgcttcttgg ctggggctgc tacatttagc atcattgtaa atactgtttt gttaattgta 840 
gagtggataa ctcaccaatt ctgtgccatg gcattgacat tgacactttg tgctgctagt 900 
attggacata taaccctggg aagcctggct tttgtcattc gagaccagtg catcctccag 960 
ttggtgatgt ctgcaccatg ctttgtcttc tttctgttct caaggtggct ggcagagtct 1020 
gctcggtggc tcattatcaa caacaaacca gaagagggct taaaggaact tacaaaagct 1080 
gcacacagga atggaatgaa gaatgctgaa gacatcctaa ccatggaggt tttgaaatcc 1140 
accatgaagc aagaactgga ggcagcacag aaaaagcatt ctctttgtga attgctccgc 1200 
atacccaaca tatgtaaaag aatctgtttc ctgtcctttg tgagatttgc aagtaccatc 1260 
cctttttggg gccttacttt gcacctccag catctgggaa acaatgtttt cctgttgcag 132 0 
actctctttg gtgcagtcac cctcctggcc aattgtgttg caccttgggc actgaatcac 1380 
atgagccgtc gactaagcca gatgcttctc atgttcctac tggcaacctg ccttctggcc 1440 
atcatatttg tgcctcaaga aatgcagacc ctgcgtgtgg ttttggcaac cctgggtgtg 150 0 
ggagctgctt ctcttggcat tacctgttct actgcccaag aaaatgaact aattccttcc 1560 
ataatcaggg gaagagctac tggaatcact ggaaactttg ctaatattgg gggagccctg 162 0 
gcttccctca tgatgatcct aagcatatat tctcgacccc tgccctggat catctatgga 1680 
gtctttgcca tcctctctgg ccttgttgtc ctcctccttc ctgaaaccag gaaccagcct 1740 
cttcttgaca gcatccagga tgtggaaaat gagggagtaa atagcctagc tgcccctcag 1800 
aggagctctg tgctatag 1818 

<210> 62 
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WO 02/04520 
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<211> 2245 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 7472734CB1 



<400> 62 

cccttggaac 

ttcctttaac 

aagccctgct 

aatatcccat 

ggactgcatc 

ttcacagagt 

taattatatc 

gtccctccac 

aaagcagtgt 

ctctgaactt 

aatgggcttt 

gatagctttc 

cactgcattc 

tgacaatgat 

ctcaaacctg 

cc tgaacggg 

ggtgtacgac 

atctcagtca 

tctaatatat 

cctccagctc 

ctccctgcgc 

aattgtagag 

tgctagtatt 

cctccagttg 

agagtctgct 

aaaagctgca 

gaaatccacc 

gctccgcata 

taccatccc t 

gttgcagact 

gaatcacatg 

tctggccatc 

gggtgtggga 

tccttccata 

agccctggct 

ctatggagtc 

ccagcctctt 

ccctcagagg 



agtaggatgt 
taagaatagt 
ctctgatctt 
ttctgagaag 
cacctgaaaa 
agtcttttaa 
tgtgacttct 
tgctcttctg 
caggatccac 
ctcttcataa 
gatgtgctcc 
ttttgcatca 
acccctagtc 
accgggaccc 
aggccacaga 
accttcccca 
agaagctctt 
ctaaaatcaa 
ggccatcttt 
gccattgtag 
ttcttggctg 
tggataactc 
ggacatataa 
gtgatgtctg 
cggtggctca 
cacaggaatg 
atgaagcaag 
cccaacatat 
ttttggggcc 
ctctttggtg 
agccgtcgac 
atatttgtgc 
gctgcttctc 
atcaggggaa 
tccctcatga 
tttgccatcc 
cttgacagca 
agctctgtgc 



tggtgatgca 
ttttattaac 
tgtacaagaa 
aatggcagac 
tccaccgttg 
gaagattcag 
gaaactattt 
cagtagtgtc 
atggcactat 
agtcattgtg 
tggatcaagt 
ccaacatcct 
atcgctgctg 
tcagcaagga 
agtgtcagcg 
acacaaatga 
tcctctccac 
tggttcaatc 
cagacaggtt 
gcacctgtgc 
gggctgctac 
accaattctg 
ccctgggaag 
caccatgctt 
ttatcaacaa 
gaatgaagaa 
aac tggaggc 
gtaaaagaat 
ttactttgca 
cagtcaccct 
taagccagat 
ctcaagaaat 
ttggcattac 
gagctactgg 
tgatcctaag 
tctctggcct 
tccaggatgt 
tatag 



aaagtcaatg 
ttttagtaaa 
catcataaag 
tattgaacag 
acttatcagg 
ttgteaacag 
ggctgcctaa 
acaccactca 
ggtaactttg 
cttcctcttg 
gggtggcatg 
actgttccct 
ggtccccctc 
tgacctcctg 
ctttatccat 
gccagacacg 
catcgtgact 
cctatttatg 
tgggagaaag 
ggcctttgct 
atttagcatc 
tgccatggca 
cctggctttt 
tgtcttcttt 
caaaccagaa 
tgctgaagac 
agcacagaaa 
ctgtttcctg 
cctccagcat 
cctggccaat 
gcttctcatg 
gcagaccctg 
ctgttctact 
aatcactgga 
catatattct 
tgttgtcctc 
ggaaaatgag 



tttaaactca 
actcagtcct 
caattcactt 
gtgtatttta 
aaactcagag 
ctagcagtct 
agttaaagga 
gtgcagggcc 
tgaaagggga 
gggatcacct 
gggagattcc 
aatattgtgt 
ctggacaatg 
agaatctcca 
ccccagtggc 
gagccctgtg 
gagtgggacc 
gctgggtcac 
ttcgtgctca 
cccaccatcc 
attgtaaata 
ttgacattga 
gtcattcgag 
ctgttctcaa 

g&gggc t taa 

atcctaacca 
aagcattctc 
tcctttgtga 
ctgggaaaca 
tgtgttgcac 
ttcctactgg 
cgtgtggttt 
gcccaagaaa 
aactttgcta 
cgacccctgc 
ctccttcctg 
ggagtaaata 



acattccact 
agtccaaaaa 
tggattttct 
ggtcacgtgg 
atcaggatct 
ctttgccaaa 
cttggggaaa 
caccaagaag 
cattttctcc 
gttcagtctc 
agatttgtct 
tggagaactt 
acactgtgtc 
tcccactgga 
agctccttca 
tggatggctg 
tggtatgtga 
ttctgggagg 
gatggtctta 
tcgtatactg 
ctgttttgtt 
cactttgtgc 
accagtgcat 
ggtggctggc 
aggaacttag 
tggaggtttt 
tttgtgaatt 
gatttgcaag 
atgttttcct 
cttgggcact 
caacctgcct 
tggcaaccct 
atgaactaat 
atattggggg 
cctggatcat 
aaaccaggaa 
gcctagctgc 



<210> 63 

<211> 3196 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<223> Incyte ID No: 



7473473CB1 



<400> 63 

gcggcggccg 

atccgggtcg 

ctttcggctc 

tccccccctc 

caagagaggg 

tgaatcaagt 

tgacggtttt 

ttgcagtttt 

ttttgacaac 



ggggagcgct 

ggctggagac 
aatcttctct 
ccccgccgcg 
ctggtggcac 
ttcttactgg 
tgtaaactct 
atgtatgggg 
tacgaatcaa 



actaccatga 
acagtcaggg 
tccacctttt 
gatcctggcc 
cgcagaacac 
gaaatgccca 
ctggatatca 
aattgactga 
actgctttga 



actgcctggt 
gaccccgtcg 
cctcctc ttc 
gctgctctcc 
atttttggag 
gattgtggat 
tcgagctgac 
caagaagacc 
agttcttctg 



cctcctcccc 
ccgccgccgc 
ctccaccttc 
agacccagga 
aacatcgtca 
tggcctgtag 
gtcatgcaga 
attgagaaag 
tacaagaaaa 
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240 

300 

360 

420 

480 

540 
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720 
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1800 
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2040 

2100 

2160 

2220 

2245 



agagctgctc 
gccccctctt 
tttgcctgca 
tgccgggggg 
ggcgctccag 
tttatagtaa 
aaagcagcac 
tcaggcaaac 
acagaacccc 
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240 
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360 

420 
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540 
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tgtttggttt 
cctgtgtact 
aggt tggacg 
gcagctcacg 
tcttcagctg 
acacattatt 
taccttctac 
catagcctgg 
aaattttcac 
aaggatgaac 
catcatcaat 
agtggtgcgt 
tggagcagca 
ctgcatatgg 
aatagacagt 
cagtgctggg 
ctttaccatg 
ggagaagatg 
tggaaatgtt 
gctgaataat 
agtcatggat 
cctctccatc 
ttttaatgaa 
agagttccaa 
ggatgccctc 
ggctatttta 
ccatgcatgt 
agccttgctc 
cac tcttac t 
agaggaggag 
cccagtcaga 
aacacagggt 
agcctccatc 
tctggcctat 
caagcccaac 
gaatggaaat 
aaaggaagac 
caagagcagt 
tgacagtgga 
gctagagcac 
ggcc ttacag 
cage tgcaga 
aaaaagcgta 
gataccatgt 
tgaaatccac 



tatatgeaaa 
ttcaaggata 
aaatttgece 
ccaatgaata 
ggatcagata 
ttacattatt 
accgccatta 
ctggtactgg 
acgactttcg 
tatctgaaaa 
gectttgaaa 
ctcttacgac 
gtcctcgtgc 
tatagcatcg 
tggctctacc 
atatgggaag 
acaagectta 
ttttcggtgg 
acaacaattt 
gtaegggact 
tatattgtct 
tgtcccaagg 
catcctgctt 
accattcact 
tgctttgtgg 
gggaagggtg 
gcgaacgtcc 
aaagtcctgg 
tgcaatctga 
gage gec tec 
aagctcttcc 
gaccctgaga 
accggaacca 
gtgaaaacca 
ggcggtgctg 
ggaaaagggt 
tggaataatg 
gattcagaga 
attacaaaaa 
agtcccatcc 
accacactgc 
atgactgccc 
ccccaggcct 
caggatattt 
ttttaa 



ttgcaccaat 
ttacgttgtt 
gattgacacg 
aaacagaggt 
tccttcctca 
gtgcttttaa 
tggttcctta 
atagtgtggt 
tgg ggcc egg 
cttggtttgt 
atgtggatga 

tgggccgtgt 

tcctggtgtg 
gagactacga 
agctggcttt 
gaggacccag 
caaccatagg 
ctatgatgat 
tccagcaaat 
tcctaaaact 
caacatggtc 
acatgagagc 
ttcgattggc 
gtgctcccgg 
tgtcaggatc 
atgtatttgg 
gggcactgac 
acttttatac 
ggaaaeggat 
ggcagaagaa 
agaagttcaa 
ggaaccaact 
gcgtggtgac 
gtgaatccct 
accaaaaatg 
ggctgegact 
tcactaaagc 
acagtgtgac 
gtgaccttcg 
aggctgatgc 
aggaagtcaa 
tagaaaagca 
catctcccaa 
ttagtgtc tc 



aagaaatgaa 
caaacagcca 
ggctttgaca 
ggtccataaa 
gtataaacaa 
aac tact tgg 
taatgtttcc 
ggacgttatt 
tggagaggtc 
gatcgatctg 
gggaatcagc 
ggctaggaaa 
tgtgtttgga 
ggtcattgat 
gagcattggg 
caaggat tea 
atttggaaac 
ggttggcgct 
gtatgecaac 
ctatcaggtc 
catgtcaaaa 
tgatatctgt 
cagcgatggg 
ggacctcatt 
cttggaagtc 
agacatcttc 
gtactgtgac 
agettttgea 
catctttcgt 
tgaggtgacc 
gcagcagaag 
ccaggtagag 
tgtgtcacag 
taagcagaac 
tctcaaagtc 
caagaataat 
tgagtcaatg 
caaaaaccca 
tttggataag 
caagcacccc 
acacgaactc 
ggtggcagaa 
atcccaaatg 
aaggcctgaa 



catgaaaagg 
atagaggatg 
aatagccgaa 
cattcaagac 
gaagcgccaa 
gattgggtga 
ttcaaaacaa 
tttctggttg 
atttctgacc 
ctgtcttgtt 
agtctcttca 
ctggaccatt 
ctggtggccc 
gaagtcacta 
actccatatc 
ttgtacgtgt 
atagctccta 
cttctttatg 
accaaccgat 
cc aaaaggee 
ggcattgata 
gttcatctaa 
tgtctgcgcg 
taccatgetg 
atccaggatg 
tggaaggaaa 
ctacacatca 
aactccttct 
aagatcagtg 
ctcagcattc 
gagctgegga 
agccgctcct 
attactccca 
aaccgtgatg 
aacagcccaa 
atgggagccc 
gggctattgt 
ctaagaaaaa 
gctggggagg 
ttttatccca 
aaagaggaca 
attttaaaaa 
ccactccaag 
tcacctgaat 



tggtcttgtt 
attcaacaaa 
gtgttttgca 
tagctgaagt 
agacgccacc 
ttttaattct 
agcagaacaa 
acatcgtttt 
ctaagctcat 
taccttatga 
gttctttaaa 
acctagaata 
actggctggc 
acaccatcca 
gctacaatac 
cctctctcta 
ccacagatgt 
caactatttt 
accatgagat 
ttagtgagcg 
cagaaaaggt 
aceggaaggt 
ccttggcggt 
gagaaagtgt 
atgaggtggt 
ccacccttgc 
teaageggga 
caaggaatct 
atgtgaagaa 
ccgtggacca 
atcagggctc 
tacagaatgg 
ttcagacgtc 
ccatggaact 
taagaatgaa 
atgaggagaa 
ctgaggaccc 
cagattcttg 
cccgaagtcc 
tccccgagca 
tccagctgct 
tactgtcgga 
taccccccca 
ctgacaaaga 
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720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

"2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3196 



<210> 64 
<211> 1602 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



7477725CB1 



<400> 64 

atggcctttg 

ctggttttta 

gctgcagcca 

ggtaatgaaa 

tcaaatctga 

ctgaatggga 

gtatatgatc 

tatcagtcac 

atcataggtg 

ctccagcttg 

gtactacget 



aggagctctt 

ttcttccctc 
ttcctggtca 
ctggaatcct 
ggccagagaa 
ctatccacag 
aaagctactt 
tgaaatcagt 
gccatgtctc 
ccattactga 
tcttggcagg 



gagtcaagtt 
tctcatgtta 
tcgttgctgg 
cagtgaagat 
gtgtcgtcgc 
cacaagtgag 
cccttcgacc 
ggttcaattc 
agacaggttt 
cacctgcgct 
tttttcttcc 



ggaggccttg 
ttaatccctc 
gtccacatgc 
gccctcttga 
tttgtccatc 
gcagacacag 
attgtgacta 
ctacttctga 
gggegaagat 
gccttcgctc 
atgatcatta 



ggagatttca 
atatactget 
tggacaataa 
gaatctctat 
cccagtggca 
aaccctgtgt 
agtgggacct 
ctggaatgct 
ttattctcag 
ccaccttccc 
tatcaaataa 



gatgettcat 
agagaacttt 
tactggatct 
cccactagac 
gcttcttcac 
ggatggctgg 
ggtatgtgat 
ggtgggaggc 
atggtgtttg 
tgtttactgt 
ttctttgccc 
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attactgagt ggataaggcc caactctaaa 
cttagtattg gacagataat cctgggaggc 
ctgcacgtgg tggcgtctgt acctttcttt 
gaatctgctc ggtggttgat aatcaccaat 
aaagttgcac gcacaaatgg aataaagaat 
agatccacca tgcaggagga gctggatgca 
ttccgcaacc ccagtatgcg taaaaggatc 
acaatacctt tttatggtac catggtcaat 
ttgcaggtac tttatggagc tgtcgctctc 
aatcatatgg gccgtcgaat aagccagata 
ttggccaaca cgtttgtgcc caaagaaatg 
ggaatcggct gttctgctgc tactttttcc 
cccactgttc tcagggcaag agcttcagga 
gcactggctc ccctcttgat gaccttaacg 
tatggaatct tccccatcat tggtggcctt 
ctgcctttgc ctgacaccat caaggatgtg 



gccctggtag taatattgtc atctggtgcc 720 
ttggcttatg tcttccgaga ctggcaaacc 780 
gtcttctttc ttctttcaag gtggctggtg 840 
aaactagatg agggcttaaa ggcacttaga 9 00 
gctgaagaaa c cc tgaacat agaggttgta 9 60 
gcacagacca aaactactgt gtgtgacttg 1020 
tgtatcctgg tatttttgag atttgcaaac 1080 
cttcagcatg tggggagcaa cattttcctg 1140 
atagttcgat gtcttgctct tttgacacta 1200 
ttgttcatgt tcctggtggg cctttccatt 1260 
cagaccctgc gtgtggcttt ggcatgtctg 1320 
agtgttgctg ttcacttcat tgaactcatc 1380 
atagatttaa cggctagtag gattggagca 1440 
gtatttttta ccactttgcc atggatcatt 1500 
attgtcttcc tcctaccaga aaccaagaat 1560 
gaaaatcagt ga 1602 
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3. I J Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 
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This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 
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inventions in this international application, as follows: 
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Polypeptide and correpsonding nucleotide as defined by SEQ 
ID N0S:1 and 33 ,methods, hosts, compositions and antibodies 
based on said sequences 



Inventions 2-32: Claims 1-108 in so far as is 

applicable for the sequences as defined below 



Polypeptide and corresponding polynucleotide defined by SEQ 
ID N0S:2-32 and 34-64, each individual polypeptide sequence 
of 2 through 32 representing an individual invention in 
combination with the correpsonding polynucleotide sequence 
of 34 through 64, where invention 2 is represented by SEQ ID 
N0S:2 and 34 and each subsequent sequential pair 
representing another invention through to invention 32 
represented by SEQ ID N0S:32 and 

64;methods, hosts, compositions and antibodies based on said 
sequences. 
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Although claim 18 is directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 

Although claim(s)32 and 34 are directed to a diagnostic method practised 
on the human/animal body, the search has been carried out and based on 
the alleged effects of the compound/ compos i tion. 
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Present claims 20, 21, 23, and 24 relate to a product/compound/method 
defined by reference to a desirable characteristic or property, namely 
agonists and antagonists of the polypeptide of claim 1. 
The claims cover all products/compounds/methods having this 
characteristic or property, whereas the application does not provide 
support within the meaning of Article 6 PCT and/or disclosure within the 
meaning of Article 5 PCT. In the present case, the claims so lack 
support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the product/compound/method by reference to a 
result to be achieved. Again, this lack of clarity in the present case is 
such as to render a meaningful search over the whole of the claimed 
scope impossible. Consequently, the search has not been carried out for 
claims 20,21,23, and 24 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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